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Preface 


The purpose of this book is to introduce 4th-year or senior undergraduate 
students to what is known as the Standard Model of Particle Physics, the 
model that presently encompasses all of our empirical knowledge about the 
subject. 

Particle physics was in a near-continual state of flux for several decades, 
finally settling down around the mid 1990s when the mass of the Z boson had 
been accurately measured, the number of light quarks and leptons had been 
established, and the top quark had been discovered. The Standard Model has 
since then faced pretty much every experimental challenge to its authority 
with flying colors, and today it stands as the established fundamental theory 
of the non-gravitational interactions, describing all known forms of subatomic 
matter that we have observed. 

The goal of this book is to familiarize students with the Standard Model 
and in so doing, with particle physics in general. It grew out of a one-term 
course I have taught at the University of Waterloo nearly every year over the 
past two decades. It was an interesting course to teach because the subject 
matter would change as particle physics continued to develop, with new results 
coming out from LEP, Fermilab, Super-K, SNO and more on the experimental 
side, and from supersymmetry, string theory, and lattice gauge theory on the 
theoretical side. Students taking the course typically had taken at least one 
course in quantum mechanics (in which they would have seen the solution 
to the hydrogen atom from Schroedinger's equation), one in mathematical 
physics (covering vector calculus, Fourier transforms, and complex functions), 
and had a solid background in special relativity (having encountered the basic 
phenomena of length contraction and time dilation). 

This book assumes that students have a good working knowledge of spe- 
cial relativity, quantum mechanics, and electromagnetism. From this basis 
students who work through the material will develop a solid command of the 
subject and a good working knowledge of the basics of particle physics, in 
terms of mathematical foundations, experimental methods, and basic pro- 
cesses. Each chapter has a number of questions, and there is a solutions 
manual available that has complete answers to all of the questions. 

I have taken the approach of describing the Standard Model in terms of its 
Electromagnetic, Strong, and Weak components, so that students can under- 
stand the subject from the perspective of the reigning paradigm. Throughout 
I have endeavored to show why this paradigm does indeed reign — in other 
words, how and why the different parts of the Standard Model came to be 
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what they are today, particularly pointing out and describing the experiments 
that were essential in arriving at these conclusions. I have also made efforts 
to show where the Model is in need of improvement and what possible physics 
might lie beyond what it describes. This is particularly addressed in the last 
chapter, but also appears throughout the book in a number of places. Our 
understanding of particle physics is by no means a finished project, and I hope 
that students will catch the excitement of the ongoing nature of research in 
this subject. 

Particle physics is both mathematically and conceptually challenging, and 
many have thought that it can only be taught in a very superficial way at 
the undergraduate level, if it is taught at all. In my 20 years of teaching this 
subject I have found that students can indeed rise to the challenge, if both the 
formalism and background are carefully explained to them in a manner that 
allows them to connect with the physics they have already learned. I have 
taken that approach in this book, beginning (after a review of relativity) with 
some basic concepts in group theory and classical mechanics that lead into the 
subjects of symmetries, conservation laws, and particle classification. Three 
chapters following that are devoted to the experimental tools and methods, 
and analysis of particle physics. The next three chapters introduce students 
to Feynman diagrams, wave equations, and gauge invariance, building up to 
the theory of Quantum Electrodynamics. The remainder of the book then 
deals with the three pillars of the Standard Model: QED in Chapters 13 and 
14, the strong interactions and QCD in Chapters 15 — 18, and Electroweak 
interactions in Chapters 19 — 24, with the final chapter devoted to what might 
lie beyond the Standard Model. I have also taken an historical approach to 
the development of the subject wherever possible, showing how it emerged 
from the physics that most students have learned about in other courses. 

The book is designed to be used in a single course over one term, essentially 
twelve weeks of lectures in a three-hour lecture week. Though I would typically 
cover two chapters per week, there is a bit too much in the book for one term, 
and so a few topics inevitably get scant attention. I suggest that students 
read the first chapter on their own, and that instructors need cover only the 
formalism in chapter 2 that may be unfamiliar to students. Instructors may 
also wish to omit the material on the Higgs mechanism in Chapter 23, and 
perhaps the material on QCD in Chapter 18 if time does not permit. 

Theoretical Particle Physics rests on the foundation of Quantum Field The- 
ory (QFT), that subject combining both special relativity and quantum me- 
chanics into a unified whole. I have found that students can learn and make 
use of the basic results of QFT — Feynman diagrams, scattering amplitudes, 
antiparticles, decay processes — without having to go through a full discussion 
of path integrals, Wick's theorem, Interaction pictures, and the like. I have 
avoided the use of the language of quantum fields, preferring to use the term 
wavefunction so that students can make better contact with what they are fa- 
miliar with. Throughout the book I acknowledge the quantum field theoretic 
foundations on which the subject stands where appropriate. My goal is that 
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students see both the forest and the trees, and not get too bogged down in 
formalism. 

That said, the subject is one requiring serious mathematical and intellec- 
tual effort. I have attempted to cater to the more mathematically inclined 
students by putting into appendices mathematically challenging material that 
enriches but is not essential to the understanding of the material in a given 
chapter. Any appendix can be avoided in a first reading of the book, and most 
students will probably wish to do this. However, calculational derivations are 
made explicit wherever possible, and students willing to work through the 
appendices will be rewarded with an enriched understanding of the material 
and a set of formidable technical skills. 

It is my hope that undergraduate students reading this book or taking a 
course that makes use of this book will be inspired by the subject of particle 
physics. I also hope that beginning graduate students may be able to make 
use of the book as preparation for more advanced courses they might take or 
as a resource for basic calculations and background material. I have tended 
to err on the side of completeness in my discussion to ensure that students are 
able to make use of the book in as broad a range of applications as possible. 

This book was written while I was at the University of Waterloo in Ontario, 
Canada, and completed while I was on sabbatical at the Kavli Institute for 
Theoretical Physics at the University of California, Santa Barbara, California 
U.S.A., for whose hospitality I am most grateful. I am also grateful to Don 
Marolf and Martin Einhorn for their efforts in ensuring that I could be hosted 
there. 
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'The most importance resource in particle physics is the Review of Particle 
Physics [1], published every other year since 1960 by the Particle Data Group, 
or PDG, at Berkeley in the United States. T'his document - over 1000 pages in 
length — contains all of the current empirical information that exists about the 
subject, along with the most up-to-date reviews on every aspect of particle 
physics relevant to the discipline. While it is not a place for beginners to 
learn about particle physics, it is truly the bible of the subject insofar as 
the information it contains is concerned. No student nor practitioner of the 
subject should be without it. A summary version appears in booklet form, but 
even that is now over 300 pages. I think a better way to access its information 
is via the PDG Web site: 


http:/ /pdg.lbl.gov/ 


From this Web site you can obtain all known information about any particle 
or process you want with only a couple of clicks of the mouse, along with any 
review article you like. 

If you want to read further in particle physics, I recommend the following 
three books, which I have found particularly helpful in preparing this book: 


A. Bettini, Introduction to Elementary Particle Physics, Cambridge Univer- 
sity Press, 2008. 


D. Griffiths, Introduction to Elementary Particles, 2nd edition, Wiley VCH, 
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University Press, 2000. 
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of them will have substantive parts that are accessible to someone in their 
final year of an undergraduate physics program: 


F. Close, M. Marten, and C. Sutton, The Particle Explosion, Oxford Uni- 
versity Press, 1987. 


P. C. W. Davies and J.R. Brown, Superstrings: A Theory of Everything?, 
Cambridge University Press, 1992. 
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It is remarkable to realize that at the beginning of the 20th century - little 
more than 100 years ago — the structure of the atom was unknown. The 
electron had only been discovered a few years earlier, and its behavior and 
properties were still not well understood. Nobody knew anything about nuclei, 
protons, quarks, neutrinos, photons, gluons, and any of the many subatomic 
particles that we know about today. Quantum mechanics and special relativity 
were unknown conceptual frameworks for describing the physical world. 

As the first decade of the 21st century draws to a close, the world will see 
the Large Hadron Collider (or LHC) at CERN turn on*. The thousands of 
scientists making use of this enormous machine - 27 kilometers in circumfer- 
ence and 24 stories underground — pivot their efforts around a key goal: to 
experimentally observe the Higgs particle and to measure its mass. If this 
experiment is successful, then we will have full empirical confirmation of the 
model — known as the Standard Model - that summarizes everything we know 
about the subatomic world at this point in history. Such confirmation would 
represent both a triumph of the human intellect and a gift of understand- 
ing that would ennoble humankind. Yet if the Higgs particle is not found, 
then the situation will be even more exciting. It will mean that something 
is wrong with our current understanding of particle physics, something that 
will be superseded by - it is hoped — more fundamental knowledge. 

So what is particle physics? Particle physics is the study of nature at the 
most reductionist level possible: it is the study of the ultimate con- 
stituents of matter and the laws governing their interactions. The 
idea that matter ultimately consisted of small indivisible particles is an old 
idea, going back 2500 years to Democritus and Leucippus of Abdera, a town on 
the seacoast of Thrace in Greece [2]. These philosophers proposed that all of 
matter was made of atoyoo, or atoms (a Greek word meaning *uncuttable") 
and empty space. 

This idea survived through the centuries, and was used by scientists such as 
Newton, Dalton, Maxwell, and Mendeleyev to explain the behavior of gases 
and chemical compounds. It grew into the subjects we now call chemistry and 


*The LHC has attracted a lot of attention worldwide, in part because of the fundamental 
questions it addresses and in part because of its large cost. There are many Web sites about 
it with information, novels have been written in which the LHC is a principal setting, a 
movie, Angels and Demons, in which the LHC plays a role, was released in May 2009, and 
there is even a rap (reproduced in Appendix J) about the LHC on YouTube! 


2 An Introduction to Particle Physics and the Standard Model 


physics, each of which has further subdivided into a variety of subdisciplines, 
that in turn have a healthy synergy with one another. 

Particle physics can be regarded as the subdiscipline that pushes the atomic 
idea as far as possible. Simply put, it proceeds from two basic observations 
about our world, common to everyday experience: 


1. Things exist (i.e. there is matter) 


2. Things happen (i.e. interactions occur) 


The goal of particle physics, then, is to reduce to as elementary a level as 
possible our understanding of these two observations. 


1.1 Methods of Study 


One of things that distinguishes particle physics from most other subdisci- 
plines in physics is in its approach to the natural world. In most other subdis- 
ciplines — optics, condensed matter physics, acoustics, biophysics — the basic 
(or effective) physical laws and constituents are known, and one works out 
the consequences of these laws’. However, in particle physics the goal is to 
discover what the laws and constituents are — one cannot take them as given. 

So how does one study particle physics? As with all of science, research 
proceeds on two fronts: experimental and theoretical. Each has a broad range 
of intellectual activity, with theoretical efforts often appearing to be nothing 
more than abstract mathematics, and experimental work seeming at times 
indistinguishable from engineering. Don’t be fooled by superficial appearances 
though! Each of these activities plays a vital role in advancing the subject, 
and the two approaches have a healthy and vibrant interplay. Conceptually we 
can categorize each approach, as summarized in table 1.1. The two columns 
in table 1.1 form the primary conceptual categories in each area. There is a 
lot of overlap both vertically down the columns and horizontally across the 
rows. Let’s look briefly at each category. 


1.1.1 Large Accelerators 


Most of our experimental knowledge of particle physics comes from colliding 
particles together at very high speeds, resulting in very energetic collisions. 
For this reason, particle physics is sometimes called high energy physics. Very 


t'This by no means makes such subdisciplines less intellectually challenging, less valuable, 
or less important. Indeed, they have led to an understanding of many novel phenomena 
and applications, including vortices, superfluids, photonic band gaps, and more. 
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TABLE 1.1 
Approaches to Particle Physics 
Experimental Theoretical 
Large Accelerators Empirical Analysis 
Detectors Model Building 
Precision Measurements Numerical Computation 
Cosmological Data Mathematical Foundations 


large machines called accelerators —kilometers in length — are needed to do 
this [3]. When we wish to examine very tiny systems (i.e. very short dis- 
tances and/or very short times) we must cope with limitations imposed by 
the uncertainty principle : 


ApAzx > h => Ap > h/Ar => Apis large for small Az 
AEAt > h= AE > h/At= AE is large for small At 


Also, since from relativity E = mc?, large mass particles need high energies 
to be created. Hence we need accelerators that can attain very high energies 
in order to study such short distance effects. In order to implement this, 
the accelerators need to be quite large in size — we'll see why in Chapter 7. 
The LHC at CERN is currently the largest such machine in the world, and is 
capable of accelerating particles to almost the speed of light [4]. 


1.1.2 Detectors 


It does no good to smash particles together unless you can see what happens. 
A detector is a machine designed to do just that. There are many kinds of 
detectors, as we'll see in Chapter 8, with the main job of each one being that 
of measuring as much physical information about the particles emerging from 
a collision as possible: their momenta, their masses, their spins, their charges, 
their energies, and so on. These detectors are typically of enormous size — the 
ATLAS detector at CERN is as high as a 5-story building (see figure 1.1) — 
because of the large amount of sophisticated apparatus needed to ensure that 
all of these measurements can take place. 

However, large detectors are not the only kinds of detectors employed in 
particle physics, nor are all detectors deployed in high-energy collision exper- 
iments. This brings us to our next approach. 


1.1.3 Precision Measurements 


Not all of what we know about particle physics comes from smashing par- 
ticles together. Sometimes we need to measure very subtle properties about 
particles that cannot be observed in high-energy collisions. For example inter- 
actions of neutrinos with other kinds matter (electrons, nuclei) do not require 
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FIGURE 1.1 


Installation of the beam pipe in the Atlas detector at the LHC in June 2008. 
(Photograph: Maximilien Brice, copyright CERN; used with permission). 


high energies. Furthermore, they are very infrequent and unlikely to occur. 
Hence there is a need for sensitive detectors to pick out the signal from the 
noisy background of the everyday world. The Sudbury Neutrino Observa- 
tory was an example of a large-scale precision measurement facility designed 
to detect the properties of solar neutrinos [5]. Other experiments — searches 
for dark matter, axions, and other exotic phenomena — employ detectors of 
all shapes and sizes, custom-made to seek out (or place limits on) the phe- 
nomenon of interest. 


1.1.4 Cosmological Data 


The early universe was an environment of a hot plasma of all kinds of particles 
[6]. The average temperature — and hence the average collision energy — 
was very high, much higher than can be attained in controlled terrestrial 
experiments. This means that observations from cosmology can provide us 
with useful and important information about particle physics. An example of 
this was a cosmological limit on the number of kinds of low-mass neutrinos, 
which had to be less than 4 from the corroboration of Big Bang nucleosynthesis 
with observation. The limit was later confirmed by experiments on the Z- 
particle [7], which showed that there were only three kinds of neutrinos that 
were lighter than half the mass of the Z. 

It is common today for particle physicists and cosmologists to interact and 
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FIGURE 1.2 

View of the Sudbury Neutrino Observatory detector after installation of the 
bottom photomultiplier tube panels, but before cabling (Photo courtesy of 
Ernest Orlando Lawrence Berkeley National Laboratory; used with permis- 
sion). 
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collaborate with one another, with findings from each subdiscipline shedding 
light on findings from the other. In fact, astroparticle physics has pretty much 
become a separate subdiscipline of its own, with a community of theorists 
and experimentalists actively seeking to further our understanding of this 
interesting interdisciplinary subject [8]. 


1.1.5 Empirical Analysis 


You might expect that a key job of a particle physicist is to analyze the data, 
and you would be right. The experimentalists do this first, converting raw 
data into usable information, such as measurements of the masses or lifetimes 
of particles. Theorists make use of this information to seek new patterns in 
the data, to critique existing analysis, and to suggest new experiments. 

'The analysis of the data itself makes use of a variety of mathematical tech- 
niques of some sophistication, and today typically require vast amounts of 
computer processing. The LHC will produce a data volume of 1 trillion bytes 
per second, equivalent to 10,000 sets of an Encylopedia Brittanica each second 
[4]. During its expected lifespan the LHC should produce an amount of data 
equivalent in volume to that contained in all of the words ever spoken by 
humankind in its existence on earth. Such an enormous volume of data per 
unit time must be supplemented by a large computational infrastructure, as 
well as a very sophisticated level of data processing and programming skill. 


1.1.6 Model Building 


A very common activity for a particle theorist is to propose a model for 
how nature works at the subatomic level. 'This involves making a clear set of 
assumptions about the particle content, the interactions between the particles, 
and the basic symmetries respected by each, all with an eye toward making 
a falsifiable prediction that an experimentalist could check. For example, 
a theorist might suggest that electrons and muons are themselves made of 
simpler particles that bind together according to some new force’. 

The difference between a theory and a model is often confusing to new- 
comers to the subject. The distinction between the two is rather subtle, and 
perhaps can best be understood in the following way. A theory is a basic 
mathematical framework used for describing physics. Quantum mechanics, 
Yang-Mills theory, and Special Relativity are all examples of these. A model 
is a particularization of a theory to a specific context — it still very much has 
a mathematical character, but also has a specificity designed to describe a 
particular system or situation. For example the quark model is a particular 


*Such models were indeed proposed, with the constituents known by names such as preons 
and rishons, and became known as substructure models [9]. 
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description of the underlying structure of particles such as the proton, neu- 
tron, and pion (more generally of all hadrons), that makes use of quantum 
mechanics, group theory, and Yang-Mills theory to elucidate its key features. 


1.1.7 Numerical Computation 


Computers have gone from playing a supplementary role in analyzing data 
to an essential role in working out the consquences of physical theory. Many 
of the problems in particle physics cannot be analyzed from a theoretical 
standpoint without the use of computers. The calculations are simply too 
big or too long for any person (or group of persons) to carry out in a reasonable 
amount of time. 

Lattice gauge theory is a good example [10]. In this approach to understand- 
ing the behavior of quarks and gluons, many theorists work on attempting to 
solve the basic equations of Quantum Chromodynamics (QCD) on a com- 
puter, where spacetime is approximated as a lattice of discrete points. The 
goal here is to solve the equations with as few approximations as possible, 
something that has eluded formal theoretical analysis thus far. 


1.1.8 Mathematical Foundations 


'This type of work involves a basic exploration of the mathematical structure 
of particle physics and its models. It is highly mathematical, and involves 
examining the basic foundations of current theory, as well as its proposed 
extensions. Here the theorist attempts to prove/refute certain properties of 
broad classes of models, with secondary regard as to their empirical content. 

String theory is perhaps the best-known example of this kind of work [11]. 
Over the past 25 years it has given birth to new mathematical methods, new 
conceptual frameworks, and new calculational techniques in particle physics, 
a number of which could have interesting implications for the subject in the 
years to come. 


1.1.9 Units 


Particle physics is commonly concerned with understanding highly energetic 
processes at very short distances. This is a regime where special relativity 
and quantum mechanics are both important, as noted earlier, and so both 
Planck's constant Á and the speed of light c, which have the values [1] 


Hh = 1.05457266 x 10734 Js 
c = 2.99792458 x 108 ms”! 


must be taken into account. Retention of these constants in every expression 
can often be a cumbersome nuisance, so most particle physicists prefer to 
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work in what are called natural units where h = E = 1 and c = 1. This also 
allows one to set the permittivity of free space, £y = 1, provided all charges 
are rescaled in units of (he) Y 2. 1 will typically adopt these conventions, 
except in certain cases where it is useful to illustrate the explicit units. This 
will typically be when I display a result that can be directly compared to 
experiment (such as a decay rate or a cross-section), in which case the factors 
of A and c are useful. 


With a bit of practice it is not hard to convert an expression in natural 
units to one with the proper powers of i and c. The general prescription 
for any given expression is to (a) express all velocities as a fraction of the 
speed of light and all times in terms of the light-travel distance (b) convert 
distances into units of inverse energy (or vice versa) as appropriate, using 
the conversion factor hc = 197 MeV-fm, and (c) express charges, masses and 
momenta in units of energy. 


TABLE 1.2 
Natural Units 


Physical Quantity Notation Units | Natural Physical 


velocity 16) unitless pa 
time t h/MeV t>ot/h 
length d hc/MeV d — d/hc 
mass m MeV/c? m — mc? 
momentum p MeV/c p pe 
charge q unitless q> Æ 
energy E MeV 


So for a given expression that depends on mass, time, momentum, energy 
and charge, to convert it to standard units just apply the conversion factors 
in the right-hand column of table 1.2. The resultant expression will be in 
terms of physical quantities with respective units of kilograms, seconds, kg- 
meters/second, Joules, and electrostatic units. There will also be a number of 
factors of h c that will cancel out to leave an appropriate resultant expression. 


For example in natural units the Compton wavelength A = 1/m. To convert 
this to physical units we set A — A/he and m — mc?, giving à — A/he = 
1/mc? or A = h/me, which is the standard formula. 
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1.2 Overview 
'The picture of particle physics circa 1940 was that everything in the universe 


was made of 4 particles: 


the electron 
the proton 


the neutron 


E 


the neutrino 


All known chemical elements were made of the first three of these in some 
combination, and it was generally believed at that time that all known ex- 
traterrestrial matter — stars, planets, and interstellar dust — were made of 
the same three particles. The last of these, the neutrino, was a hypothetical 
particle needed to ensure that radioactive processes respected conservation of 
energy and angular momentum, but had no direct observational confirmation 
at that time. 

The following 60 years saw a radical modification in our understanding of 
the subatomic world. As experimental energies increased, hundreds of new 
particles (almost all of them unstable) were discovered. For a period of time 
the subject was in a considerable amount of confusion, but by the end of 
the 1970s a general understanding of the situation had emerged, along with 
a model — now called the Standard Model — that could describe all current 
knowledge of the subject [12]. 


1.2.1 Bosons and Fermions 


Today we know that all matter and its physical interactions can be described 
in terms of two basic kinds of particles: bosons and fermions.  Bosons are 
particles of integer spin in units of Planck’s constant f — they are the ele- 
mentary particles that govern what we describe as a force in the everyday 
world. Fermions are particles of half-integer spin in the same units, and are 
the elementary constituents of what we call matter. 

Bosons and fermions are distinguished by their collective properties under 
the interchange of two particles. Suppose we have a system consisting of two 
identical particles. If we enclose the system in a box, then the probability of 
finding one particle in a given position and the other in another position — 
let's call this P(1,2) — must be equal to the probability of finding the particles 
interchanged in position (in other words we have P(1,2) — P(2,1)) because 
they are identical and so we can't tell them apart. Since quantum mechanics 
implies that probabilities are given by square of wavefunction amplitudes, we 
have 

P(1,2) = P(2,1) > |v(1,2)? = |w(2,1)? (1.1) 
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However, this doesn't mean that the wavefunction V(1,2) = W(2,1). Instead 
we have the more general possibility that W(1,2) = e'?W(2, 1) where ¢ is some 
phase. Applying the switch again would give V(1,2) = e?/^vw(1,2), implying 
that et? = +1, or 


V (2,1) boson 


0/25 { —W(2,1) fermion (12) 


and we mathematically define bosons to be particles whose wavefunctions 
maintain sign under particle interchange, whereas fermions are particles whose 
wavefunctions flip sign under particle interchange. The spin-statistics theorem 
states that all fermions have half-integer spin and all bosons have integer spin 
[13]. In this text I will assume as valid the conditions that render the theorem 
true’. 

Note that indistinguishable means just that — two elementary particles of 
the same type are perfect duplicates of one another. It is simply not possible 
by any measurement we can make to tell one electron apart from another, 
or put a label on one v* to distinguish it from another 7+. This property 
of elementary particles is unlike anything in our everyday experience in the 
macroscopic world, where we are used to things that are similar — such as 
identical twins, or computers off of an assembly line — but not exactly the 
same. The elementary microscopic constituents of our universe are huge in 
number — about 109? particles in all — but are of only 38 elementary types 
(including antiparticles), as we shall see shortly. It could have been otherwise, 
in which case it is hard to imagine how a coherent physical description of the 
universe would be possible. Just imagine trying to construct a theory with 
109? different kinds of particles, each of which had its own distinct properties! 

Of course the existence of these 38 elementary types is our state of knowl- 
edge at the present time, and we now know that it is almost certainly incom- 
plete. Over the past three decades we have discovered from observations in 
cosmology and astronomy that only 496 of the total energy of the universe 
is made of known matter (ie. the matter that makes up the elements in 
the periodic table). Another 23% of the this energy budget is dark matter, 
whose presence is known to us only by the gravitational attraction it exerts 
on galaxies and clusters of galaxies [6]. Its composition in terms of elementary 
particles remains unknown to us at this point in history. The remaining part 
of the energy — 7396 — is called dark energy, which is even more mysterious 
since it is causing the universe to accelerate in its expansion, whereas ordi- 
nary mass/energy (and the dark matter) would exert a decelerating influence. 
The ultimate composition of this form of energy is not at all clear, though 
the simplest explanation would appear to be that it is the vacuum energy 


ŠA generalization of the result that all bosons have integer spin and all fermion half-integer 
spin (in units of A) occurs in theories with only two space dimensions. In this case it is 
possible to have particles that have any possible spin, and such particles are called *anyons." 
These kinds of wavefunctions have applications in condensed matter physics [14]. 
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of the universe, ie. the ground-state energy of the aggregate of all bosons 
and fermions [15]. The problem with this interpretation is that although the 
cosmological vacuum energy is the largest fraction of the total energy of the 
universe (about 3/4), theoretical calculations indicate that it should be much 
larger than the value we observe — about 101?° times larger! 

Nobody knows how to resolve the puzzles of dark energy and dark matter, 
and much effort is currently being expended by cosmologists, astronomers, 
and particle physicists to find out what these things are and how they behave. 
For the most part I will ignore these interesting issues thoughout most of this 
textbook, concentrating on elucidating the structure of the 4% of matter that 
we do know something about, and which is described by the Standard Model 
of particle physics. But there is one thing that we can be fairly confident 
about — whatever the dark stuff is, its elementary constituents will be bosons 
and/or fermions. 


1.2.2 Forces 


As far as experiment has been able to tell us, all known interactions in the 
world are governed by some combination of four basic forces: gravity, elec- 
tromagnetism, nuclear (called the strong force), and radioactive (called the 
weak force). These forces have very different properties and manifestations, 
as illustrated in table 1.3. 

Note that each force is associated with something called a mediator. What 
does this mean? Suppose we have a source of electric charge Q and we want to 
know what force a small test charge q experiences in its vicinity. In classical 
physics the answer is well known: we express the effect of Q on q in terms of 
something called an electric field E: 


Fon g= qE = oi +--+ = (monopole) + (dipole) +(quadrupole, etc.) 
TT 


which we say is due to the source Q. It is a vector, each component of which 
is a continuous function of the distance r. Of course the electric field itself is 
not directly observable; only the force F is. 
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This classical notion of force is modified at short distances due to quantum 
effects [16]. As Planck and Einstein noted in the last century, in order to 
describe certain phenomena (such as blackbody radiation or the photoelectric 
effect) the electromagnetic field should be “lumpy” — that is, it should come 
in distinct quanta called photons. In this picture, the source Q influences q 
by exchanging a photon with it, as shown in figure 1.3. 


"AF d p 
X 


FIGURE 1.3 

'The left side is the classical picture of forces and fields, in which an element 
of charge Q a displacement 7 away exerts a force F ona test charge q. The 
right side is the quantum picture, in which the force is mediated by photons 
exchanged between Q and q, here represented by the dashed line. 


We say that the exchanged photon is virtual, which means that it is not 
directly observed. But this means that we shouldn’t see it transmit any net 
energy or momentum. In other words, the exchange of this virtual photon can 
only satisfy the requirements of energy and momentum conservation provided 
that the energy AE and momentum AP exchanged in the process cannot be 
detected. This will be true if these quantities are bounded within the limits 
of what the uncertainty principle permits (so that any putative detection of 
the photon is washed out by quantum uncertainty). Specifically this means 
that 

AEAt<h and | Ap||AT|<h (1.3) 


where 


At — r/c — time scale for photon exchange to take place (1.4) 


|A7’| =r = distance scale for photon exchange to take place 


So the force experienced by q due to N virtual photons each transferring 
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momentum AP is 


F= NAP = vere = Nhe“, = Bb; (1.5) 


provided we normalize N = Qq/(4rhc) — in other words, the number of 
virtual photons emitted/absorbed should be proportional to the product of 
the charges. In this sense the photon mediates the electromagnetic force! 

'This concept of a mediator is how all forces are understood in the context 
of modern particle physics. The basic idea is that any two subatomic particles 
exert forces on each other by exchanging other virtual quanta of subatomic 
particles — the mediators. 

There are 12 mediators for all the non-gravitational forces: 1 photon, 8 
gluons, and 3 weak vector bosons, more commonly referred to as the W*, 
W” and the Z. So far, experiment has indicated that these mediators have 
helicity 1 (ie. integer spin A). This is important, since our descriptions of 
bosons entail the following properties of bosons listed in table 1.4. Pm not 


TABLE 1.4 
Attractive/Repulsive Character of Forces 


ODD-INTEGER SPIN PARTICLES: mediate forces that are 
both attractive and repulsive 


EVEN-INTEGER SPIN PARTICLES: mediate forces that are 
either attractive or repulsive 


going to prove the results in table 1.4 here — they can be demonstrated from 
some basic properties in quantum field theory. What this means is that since 
all non-gravitational forces are mediated by spin-1 bosons, they all have both 
an attractive and a repulsive character. 

What if we include gravity? The fact that gravity is always attractive means 
that it must be mediated by an even-integer spin particle; we call this particle 
the graviton and it has spin-2. The Higgs boson (should it be found) has 
spin-0 and so will have a purely attractive character as well. This will add 
two more elementary particle types to the list, for a total of 14 out of the 38 
elementary types?. 


TIt could be argued that I shouldn't include these two because, strictly speaking, we don't 
have direct observational confirmation of their existence. However, there is little doubt 
that the graviton is present, and the Higgs particle is foundational to our understanding 
of the Standard Model. I may be skating on thin ice with this one though: more Higgs 
particles than one may be discovered at the LHC or, more radically, no Higgs particle may 
be discovered. 
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1.2.3 Matter 


So far we have been exploring bosons: the things that mediate forces. Now 
let's look at particles: the stuff that things are made of. 

All known matter (excluding dark matter and dark energy) is made of spin- 
1/2 particles called fermions that have anticommuting statistics: 


W(1,2) = —w(2,1) 


where (1,2) is the wavefunction for a system of two identical fermions. The 
proton and the electron are the best known of these, with the electron being 
the first fermion to be discovered (in 1897) [17]. The nucleus was discovered 
by Rutherford in 1910 [18], but it was not until 1932 that the nucleus was 
understood to be a composite object consisting of two kinds of fermions, 
known as protons and neutrons. 

Uhlenbeck and Goudsmit postulated that the electron had spin h/2 [19] in 
order to properly account for the Zeeman effect [20] and the Stern-Gerlach 
experiments [21]. This meant that Schroedinger’s wave equation could not 
be used to describe the behavior of electrons, since it did not have take spin 
into account. In 1927 Dirac wrote down a relativistic wave equation that 
he required to be LINEAR in E and P (ie. linear in the operators 0/0t 
and V. This equation predicted that a charged particle of —e had spin h/2. 
Remarkably, it also predicted that there was another particle of spin h/2 with 
identical mass but charge +e. This particle is the antiparticle of the electron 
and is called the positron. Just as remarkably, it was discovered by Anderson 
in 1933. 

All spin-/1/2 (or spin-1/2) particles obey Dirac's equation; you just adjust 
the mass and charge in the equation to describe the particle of interest. Hence 
each spin-1/2 particle has a corresponding antiparticle. This is also true for 
most bosons, but for some bosons (the photon being the best known example) 
the antiparticle is the particle itself ll. In general all quantum numbers of an 
antiparticle are the negative of that of its corresponding particle except for 
its mass, which remains unchanged. 

As far as experiment has indicated, all spin-1/2 particles in the world come 
in two basic types, leptons and quarks, listed in table 1.5. We can also con- 
struct a table listing the basic features of each of these kinds. In table 1.6 
the quantity “color” labels the three distinct kinds of strong charge that a 
quark can have. These colors — called red, green and blue — have nothing 
to do with actual colors we can see — instead the term “color” is shorthand 
for "strong charge." Antiquarks have strong charges antired, antigreen and 
antiblue. Since the leptons do not experience the strong interactions, they 
have no color charge (they are color neutral) and so that entry in table 1.6 is 
blank. 


lt is also possible for a fermion to be its own antiparticle; in this case it is called a Majorana 
fermion instead of a Dirac fermion 
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TABLE 1.5 
'The Kinds of Matter 


LEPTONS particles that do not 
experience strong interactions 


QUARKS particles that do 
experience strong interactions 


) 


TABLE 1.6 
Basic Properties of Quarks and Leptons 
Flavor EM Charge Color Helicity Mass (MeV) 
L electron e -1 L,R 0.511 
S e-neutrino Ve 0 L,? « 2 x 1078 
T muon JL -1 L,R 106 
O p-neutrino vj, 0 L,? < 0.19 
N tau T -1 L,R 1777 
R T-neutrino v, 0 L,? < 18 
Q up u +2/3 R,G,B L,R 2 
U down d -1/3 R,G,B L,R 5 
A charm c +2/3 R,G,B L,R 1200 
R strange s -1/3 R,G,B L,R 100 
K top t +2/3 R,G,B L,R 171,000 
S bottom b -1/3 R,G,B L,R 4,200 


The quantity “flavor” is actually the charge experienced by the weak inter- 
actions. We will see that a strong interaction has the effect of changing a 
quark of one color (red, say) into a quark of another color (blue, say). Like- 
wise, a weak interaction has the effect of changing a particle of one flavor (for 
example an electron) into another flavor (its corresponding neutrino). So in 
this sense flavor is to weak what electric-charge is to electromagnetism and 
what color is to strong (and what mass is to gravity). Note that flavor is 
the property by which we distinguish different types (or species) or particles. 
'This is often the source of some confusion when one first tries to learn about 
the weak force, and so I have deferred the discussion of weak interactions to 
appear in Chapter 20 after the electromagnetic and strong forces in order that 
you can become more comfortable with a number of other concepts first. So 
there are 6 flavors (or types) of leptons, plus each of their antiparticles, and 
6 flavors of quarks, plus each of their antiparticles, for a total of 24 fermion 
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particle types in all**. Including the 14 types of bosons, this brings the list of 
elementary particle types to 38. 

An additional complication in understanding particles has to do with quarks. 
Despite considerable effort over the years, it has not been possible to directly 
observe quarks. What we actually observe are bound states of quarks. It 
is generally believed (but not yet proven!) that quarks can bind together in 
only two possible ways, given in table 1.7. We call a bound state of quarks a 


TABLE 1.7 
Quark Bound States 
BARYONS:  qqq (a 3-quark bound state) 


MESONS: q#q_ (aquark-antiquark bound state) } HADRONS 


hadron. So far all observed hadrons have come in one of two types: baryons 
(which are bound states of three quarks) and mesons (which are bound states 
of a quark with some other antiquark). All hadrons are color-neutral, and 
the proton is the only stable hadron known (a free neutron will decay in less 
than 15 minutes). Understanding this structure will be one of our main tasks 
when we come to the strong interactions beginning in Chapter 15. 

Note that the masses of the particles vary quite widely and (unlike the 
quantities in the rest of table 1.6) appear to have no pattern. It is generally 
believed that the masses of all elementary particles are acquired through some 
symmetry breaking-effect with a field called a Higgs field. The basic idea is 
that the Higgs field causes a kind of “drag” on all elementary particles of 
varying strength, and this drag is what we perceive as inertial mass. A 
consequence of this idea is that there must exist at least one spin-0 boson, 
which is called a Higgs particle. In October 2000 tantalizing evidence was 
presented from LEPII at CERN that the Higgs particle has been observed 
with a mass of approximately 114 GeV. However, subsequent analysis of the 
data has failed to confirm this, and the best that we can say at this point 
in time is that the Higgs mass is not less than 114 GeV. One of the main 
purposes of the LHC is to find the Higgs particle if it indeed exists. 

Finally, note that both the leptons and the quarks come in three groupings 
according to mass: a light group, a medium group, and a heavy group. We 
refer to these groupings as generations (or sometimes as families). The light- 
est generation consists of the up and down quarks, and the electron and its 
neutrino (plus all their antiparticles). As noted above, all matter in the peri- 


**It might be argued that I shouldn't count the antiparticles, since they are just like the 
particles except for a reversal of the signs of all the charges. I have made the distinction 
because a particle and its antiparticle counterpart are not identical — they can easily be 
distinguished in experiment. 
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odic table is made of combinations of these 4 particles (with the up and down 
quarks combining into protons and neutrons). Each generation is identical 
to the others in structure except for the differing masses. At present it is 
not known why this structure exists, and why the pattern has this particular 
form. 


1.3 The Standard Model 


The Standard Model of particle physics refers to the sum total of our knowl- 
edge of all the forces and particles described above. It is a particular kind of 
quantum theory, called a quantum field theory, that has a particular particle 
content and a particular symmetry group. There are infinitely many other 
models of the same general type that one could construct, but with different 
particle content and/or symmetries, as well as different values for the pa- 
rameters. The Standard Model is the one that describes what we actually 
observe. 

'This particularity sounds reasonable — after all, isn't it the job of physics 
to describe what is observed? Furthermore, so far there are no experiments 
known that disagree with the predictions of this model. However, the model 
has a number of parameters that must be input from experiment before any 
further predictions can be made, summarized in table 1.8. 


TABLE 1.8 
Standard Model Parameters 


masses of all fermions 
coupling constants 
mixing angles 

vacuum angle 

Higgs mass and coupling 
Vacuum Energy 


RIN A 00 


We see from table 1.8 that there are a total of 28 parameters in all. I’ve 
included the possibility that neutrinos have nonzero mass, though strictly 
speaking the Standard Model assumes they have zero mass. I've also included 
gravity (which accounts for one of the coupling constants of the four forces) 
and the strength of the dark energy (the vacuum energy) as two of the pa- 
rameters. Together with General Relativity, the Standard Model provides us 
with 38 coupled differential equations for each of the particle types, depen- 
dent on the 28 parameters above. Once these 28 parameters are given we can 
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predict the outcome of any process in particle physics from this model — and 
so far there are no experiments that disagree with any predictions which have 
been made. Not all of these parameters have been accurately measured, and 
their precise measurement is one of the current tasks of experimental particle 
physics. 

'The origin of these parameters and their empirical values is an unexplained 
mystery. It is hoped that a unified theory of physics (if one can ever be 
constructed) will explain both their values and their interrelationships, and 
that the set of 38 equations will actually be derived from one more unified 
theoretical structure, ideally one master equation. At present we don't know 
what that might be, so we must make do with the model that we have if 
we want to explain our observations of the subatomic world. You might 
think that 38 equations is a bit much to deal with. It is, but in practice we 
don't have to deal with them all at once — instead we can work with subsets 
of these equations, depending on the system of interest. Furthermore, the 
different equations have some common (and elegant) features amongst them 
due to the particular symmetries of Lorentz invariance and gauge invariance, 
subjects we will explore in chapters 2 and 12 respectively. 

The purpose of this book is to teach you about the Standard Model: to 
fully acquaint you with its basic structure and features, to show you how 
to compute simple predictions from it, and to inform you of its empirical 
underpinnings. So let's get started! 
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FIGURE 1.4 

A pictorial representation of the different particles in the Standard Model and 
their interactions. A line between different shapes indicates that all particles 
within a given shape interact with all others in the connected shape, except 
for the photon, which does not interact with particles below the dashed lines 
(ie. the neutrinos and the Z). Lines joining a shape to itself indicate self- 
interactions amongst the particles in the shape. 


1.4 Questions 


1. (a) It is possible to understand the strong force between a neutron and 
a proton as being effectively mediated by a boson called the pion, whose 
mass is 139 MeV. Use this to estimate the range of the strong force. 


(b) Similarly the weak force is mediated by the W-boson, whose mass 
is about 80 GeV. What range do you estimate this force has? 


2. Before the neutron was discovered, beta-decay (the emission of an elec- 
tron from a nucleus) seemed to support the idea that a nucleus consisted 
of protons with electrons trapped inside. The atomic number of a nu- 
cleus was given by the excess of protons over electrons. 
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(a) Using this picture, estimate the minimum momentum that a trapped 
electron must have. 


(b) Use the relativistic relation E? = (pc)? + (mc?)? to estimate its 
corresponding energy. 


(c) How does this energy compare to that for an electron emitted in a 
typical beta-decay process? Does beta-decay tend to support or refute 
the trapped-electron model of the nucleus? 


3. An experimentalist wants to probe distances of d < 10-?? cm. How 
much collision energy must the machine be able to produce? How does 
this compare with the maximum energy of the LHC? If the size of the 
machine scales with the energy, how large would this machine have to 
be? 


4. Suppose the electron had a mass 10 times its mass of 511 KeV. What 
particles would you expect to populate the universe? 


5. Suppose that fermions were found to have different masses from their 
antiparticles. How many new parameters would there be in the Standard 
Model? 
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A Review of Special Relativity 


DOI: 10.1201/9781420083002-2 


Particle physics is concerned with understanding the behavior of the natural 
world in the most reductionistic manner possible. This means we need to 
probe interactions over very short timescales and very tiny distances. This 
in turn requires, among other things, an ability to collide particles at very 
high energies, energies that are far larger than the rest mass energies of the 
particles in the collision, where relativistic effects are significant. 

So for particle physics an understanding of special relativity is crucial. Here 
I will review the basics of special relativity [22]. You probably have seen this 
material elsewhere — but what we do here might be in a repackaged notation 
that you may not be familiar with [23]. 


2.1 Basic Review of Relativity 


The basic postulates of special relativity are twofold: 

1) The laws of physics are equally valid in all inertial reference systems, 
where inertial means that Newton’s first law holds. 

2) The speed of light is constant and of the same value in all inertial systems. 


The first of these postulates is familiar from Newtonian mechanics. It 
means that the laws of physics do not depend upon either the location, the 
orientation, or the constant velocity of the reference frame that we use to 
describe motion. 

It is the second postulate that is counterintuitive, forcing a conceptual leap 
in our understanding to less than familiar territory. So let’s explore this 
postulate in more detail. 

Suppose we have 2 inertial systems (frames) S and S’, where S’moves at 
velocity v with respect to S. If we orient the axes of both systems so that 
they are identical at t = t/ = 0 and so that the relative motion is along the 
x/x' direction, then we know that 


x’ = y(x — vt) 

f = 1 
TTE where y = === (2.1) 
gd A — vie 


t= «(t — vx/c?) 
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a transformation known as a boost transformation. 'To invert this transforma- 
tion (i.e. to go from S" to S ), just change U => —U: 


yu (2.2) 


You can easily check that solving eq. (2.1) for x’ and t’ in terms of x and t 
results in eqs. (2.2). 

'These transformations have several important physical consequences. Let's 
look briefly at what they are. 


2.1.1 Relativity of Simultaneity 


An event is defined as something that occurs at a specific place and time, 
along with all necessary preconditions and unavoidable consequences. It is 
something that happens in the real world, such as a supernova, an earthquake, 
or the fall of a raindrop. Usually we think of events taking place in a particular 
chronological order, regardless of who is observing them. The first surprise 
of special relativity is that this is not the case: events occurring at the same 
time in S but at different locations do NOT occur at the same time in S'. 

Let's look at this more carefully. Suppose an observer in frame S records 
two events A and B located at the respective positions rA and xp as taking 
place at the same time: £4 = tg. The above transformations then give 


v 
>t, — tp = (ta — vra /e) — y(tg — vzp/c?) = Va (tp —xza)#0 (2.3) 


and so the observer in 5” would record them as happening at different times. 
As an example consider an event (fig. 2.1) in which lightning strikes both 
ends of a moving train. There is an observer on a flatcar in the middle of 
the train that is moving (Mona), and another observer (Stan) standing on the 
ground beside the train as it moves by. Let F be the event “Lightning strikes 
the front end,” and B be the event “Lightning strikes the back end." 

Stan receives (“sees”) the light emitted from events F and B simultaneously, 
and concludes that: “since I am midway between the marks on the tracks, and 
since the speed of light is a constant, then events F and B are simultaneous.” 
However, Mona receives (“sees”) the light emitted from event F first, then 
from B, and concludes that: “since I am midway between the marks on the 
train, and since the speed of light is a constant, then events F and B are NOT 
simultaneous: F occurred before B.” See fig. 2.2 for a diagram in space and 
time of this event. 

The reason this happens is that light has a finite speed, and so the infor- 
mation conveyed in the light from the bolt in the back has to “catch up” to 
Mona, who is riding in the middle of the train. Similarly Mona is “catching 
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FIGURE 2.1 
Lightning strikes the front and back ends of a moving train. The lightning 
strikes leave marks on the train and on the tracks, and the pulse of light is 
large enough to be detected by both Stan (on the ground) and Mona (on the 
train). Mona sees the front signal before the back signal; Stan sees both at 
the same time. 


up" to the light coming toward her from the bolt at the front of the train. 
Stan, on the other hand, is not moving and so sees the bolts at the same time. 

As we will see, since the second postulate implies that nothing travels faster 
than the speed of light, it will in turn imply that the transmission of all forms 
of information cannot travel faster than light. Unlike a Newtonian universe 
(in which an infinite speed of information transmission is possible in principle), 
finite transmission speeds in a relativistic universe force us to abandon a notion 
of universal simultaneity. 


2.1.2 Length Contraction 


This phenomenon refers to the fact that an object at rest in S" is shortened 
with respect to S. For example a rod of length L at rest in S’ can be positioned 
so that one end is at x’ = 0, the other at x’ = L. Since it is moving in S, we 
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FIGURE 2.2 

Paths of Stan (dotted), Mona and the ends of the train (dash), train tracks 
(solid), and the light pulses (dot-dash) from the lightning bolts as they move 
forward in time. 


must record its length by measuring its ends at the same time (t = 0, say). 
So its length in S is, using (2.1) with t = 0: 


1 
Tone end — "other end — 3 (S né end ^ T ther ad) =L/y<L (2A) 


So moving objects are shortened along the direction of motion. 


Consider observers on a spaceship that have a poster of Einstein on the wall 
of their ship. From their perspective the picture is 4 ft. x 3 ft. However, an 
observer standing on an asteroid looking through the window of the ship as it 
goes by at 77% of the speed of light will see that the poster is only 4 ft. x 1.5 
ft., as shown in fig. 2.3. 


The importance of this phenomenon in particle physics occurs in the con- 
struction of accelerators — as the particles move at high velocities down an 
accelerator tube the effective length of the pipe is shortened in their direction 
of motion, and must be taken into account in the design parameters of the 
machine. 
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What observers on the ship see What observers on the asteroid see 


FIGURE 2.3 


Length contraction demonstrated pictorially. Apic/Hulton archive/Getty Im- 


ages. 


2.1.3 Time Dilation 


Suppose there is a clock ticking off an interval in S’ — a consequence of rela- 
tivity is that it ticks off a longer interval in S. If the clock runs from t = 0 
to t' = T, then using eq. (2.2) above we have 


Atin s = AY + vAz'/c?) = 4At' = 4T» T (2.5) 


mirror 
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FIGURE 2.4 
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What the asteroid observers see on the 
spaceship: the pulse is received, but at a 
different spatial point 


A laser clock for measuring time dilation. 
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where Az’ = 0 because the clock is always at the same location in S'. So 
moving clocks run slow. 

Consider a “clock” that consists of a laser capable of sending out very short 
pulses and a mirror to reflect those pulses back to a detector (fig. 2.4). The 
proper time will be measured by an observer at rest with respect to this 
“clock.” Suppose this clock is on that same spaceship. Then an observer on 
the asteroid can watch as this “clock” goes by him/her at some speed (from 
his/her point of view). This observer will see the light pulse traveling a greater 
distance before it returns to the detector (see fig. 2.4). Since according to 
the postulates of special relativity the speed of light is the same for both 
inertial observers, the observer on the asteroid will claim that the spaceship 
clock is running slowly with respect to asteroid clocks. If the spaceship is 
moving at 7796 of the speed of light, then for every three hours of time on the 
asteroid, the spaceship clocks register that only a little more than two hours 
have passed. 

In particle physics we see this phenomenon manifest in decay rates of un- 
stable particles: they take longer to decay the faster they are moving. 


2.1.4 Velocity Addition 


If a particle has velocity u' in the x’ direction in S", then its velocity in S is 


"E Ar MARIA, (Az. +) ES (2.6) 
At  y(At+vAxr'/c) (1+ 547) 148 


showing that velocities do not simply add. Notice, though that if the speed of 
light is large compared to either of the velocities, then 


/ 1 

e o (e) ato (2.7) 
lex 

and we see that velocities add as they do in the non-relativistic everyday world 

of experience. 

This is one of the more counterintuitive things to understand about relativ- 
ity, but it follows from (and is consistent with) the second postulate. To see 
how, suppose the moving observer in S’ is shining a flashlight pointed in the 
direction of motion. The light will leave with speed c as seen by the moving 
observer. This means that we must set |u'| = u’ = c. The observer in S then 
measures this speed to be 

u= Can = =e (2.8) 


1+% c+ u 


or in other words, the speed of the light is seen to be exactly the same, in 
accord with postulate 2! 
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2.2 4-Vector Notation 


In the examples in section 2.1 the two frames were neatly aligned along the 
direction of motion. Of course this need not be the case; in general they could 
be moving in any direction relative to each other. So it’s useful to introduce 
simplifying notation that unifies time and space components together. 

This anticipated union suggests that vectors will have to be generalized 
from their 3-component form to having 4-components: one for each spatial 
direction and one for time. We call such objects 4-vectors, and can write one 
of them as, say, a^ = (a°, @). The Oth component is the “time” component, 
and the others are the spatial (3-vector) components. 

Let's define 
aod 22 ay qs (2.9) 


to be the components of a 4-vector z^" that denotes the location in space 
and the time elapsed at some event (relative to a fixed choice of origin). 
Multiplication of the time by c ensures that each coordinate has units of 
length. In other words, we measure the time elapsed in terms of the light- 
travel-time: how far light would travel in a given time. Since all observers 
agree on what the speed of light is, the difference in ct as measured by various 
observers is equivalent to the difference in time t. 

Now we can write the particular Lorentz transformation given in eq. (2.1) 
as a matrix equation 


à y —8*00 
g^ = y AP ye” where A", = E o > s (2.10) 
T 0 0 01 


1 
1-82 
nents of the matrix A”,, are zero, except for 


with y — and f = v/c. So for the transformation (2.1) all compo- 


AQ A Sy A =A =-Pr A= A4351 (2.11) 


Before continuing, let’s get a few conventions straight: I will use Greek 
letters to indicate spacetime indices that run over the values 0,1,2,3, and I will 
use Latin letters from the middle of the alphabet (i, j, k, ...) to denote spatial 
indices of the type familiar from vector calculus; these will take on the values 
1,2,3. Pll also use the summation convention: repeated indices are summed 
over (unless otherwise indicated). This means I can replace yoa A" 2” with 
A" 2” in eq. (2.10) since the v index is repeated. Repeated indices almost 
always occur in pairs, so it is always clear what indices to sum over - Pll 
make it clear whenever this is not the case. Note that the repeated index is 
arbitrary, since we are summing over all of its values; in the sum in eq. (2.10), 


30 An Introduction to Particle Physics and the Standard Model 


instead of y we could have used p, c, c, or any Greek letter we wanted! We 
can use the summation convention for any repeated index (Greek or Latin). 
So Greek indices are summed from 0 to 4, and Latin indices from 1 to 3. 

Now by postulate (2), a pulse of light emitted at t = 0 and # = 0 in S 
has the same velocity in S". Hence the distance ds to the wavefront from any 
point Az in S at time At is the same as the distance ds’ measured from any 
point Az’ in S'at time At’. Using Pythagoras’ theorem we have 


(cAt)* — (Axt)? — (Az?) — (Az?) = ds? = ds"? 


= (cAt')? — (Az)? - (Ar?)-(Ar%)? — (212) 


and you can easily check that (2.1) and (2.2) satisfy this relation. 

The quantity is called an invariant; it has the same value in any inertial 
system. Invariants are nothing new — this concept is present in non-relativistic 
Newtonian mechanics. For example the length of a vector is invariant under 
rotations in Newtonian mechanics. What is new in relativity is that invariants 
typically involve a mixture of spatial quantities with temporal quantities. 

The invariant quantity above in eq. (2.12) is a bit cumbersome, and it 
would be nice to write it in the form of a sum. The problem is we have 3 
minus signs. We deal with this by writing 


ds? = gu AIt Aa" = guy Ae PAC" (2.13) 
where g,, is a matrix called the metric 


+1 


aa goo = +1 
Juv = >j i.e. Jij = —Óóij (2.14) 


which generalizes the 3-dimensional Kronecker-delta function used in vector 
calculus as you can see from eq. (2.14). 
Notice that, since z/" = A" , z", we have 


Bard a sep og) (a a2”) = (9, A^ AY s) on (2.15) 


Since this relation has to hold for all possible z^ and x'*, we must have 


Ju N" aA" g = Gas (2.16) 


This “boxed” relation is very important: it must be obeyed by all Lorentz 
transformations A",. Alternatively, any matrix obeying this “boxed” equa- 
tion is a valid Lorentz transformation. We take the boxed relation to mathe- 
matically define a Lorentz transformation. 

While there are infinitely many possible Lorentz transformations, it is pos- 
sible (and useful) to categorize them into six kinds. Since there are three 
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distinct directions (along x,y, and z ) for a moving observer to travel, three 
kinds of Lorentz transformations are transformations from the stationary to 
the moving observer along each of these directions. These are called boosts. 

What are the other three kinds of transformations? Suppose we don't 
make any boosts. This means that A must have the form 


A^ = E 2) (2.17) 


where Rij is some 3 x 3 matrix. It’s easy to show from the boxed relation 
that this matrix must obey RikRjk = 043, which in matrix form is RTR = 
I. This is just the defining relation for a rotation (or rather an orthogonal 
transformation, to put it in mathematical terms), and so we see that rotations 
are a subset of the Lorentz transformations! It’s clear then that there are 3 
kinds of rotations (about x,y and z), and putting these together with the 3 
boosts makes a total of 6 kinds of continuous Lorentz transformations. 

Lorentz transformations are actually the relativistic analog of rotations. 
They transform 4-vectors analogous to the way that rotations transform 3- 
vectors. In this sense they define for us 4-vectors (and 4-tensors — mathe- 
matical objects with more than one Lorentz index) just in the same way that 
rotations are used to define 3-vectors (and 3-tensors — mathematical objects 
with more than one spatial index). We can make the following comparative 
table: 


TABLE 2.1 
Rotations compared to Lorentz transformations 
Rotations Lorentz Transformations 
Riz Fk = big gu, A" AY B = Gop 
$ (Z) = (REE) SCALAR o (a 2 = AA 
Wi) = REV (Re) “VECTOR,  V@@) =4 Ve t) x”) 
T|¡(2) = RER}Ta(R x) TENSOR T'*é(5)=A* AP A) 


2.3 Spacetime Structure 


We can use the metric to define a new vector x, from our event-vector x” and 
vice versa: 


Y, = Juve" and g = gtp (2.18) 
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where g"" is the inverse of g,,. So using this we can write (2.13) as 
ds? = Ax, Ax" = Aq AR (2.19) 


“Raised” indices are called contravariant; “lowered” indices are called covari- 
ant. This distinction — unimportant in the three-dimensional flat spatial world 
of vectorial quantities in Newtonian mechanics — is of crucial importance in 
our description of the four-dimensional relativistic world. A given covariant 
vector A, and its contravariant counterpart A” contain the same informa- 
tion — it is how this information is expressed and manipulated compared to 
other physical quantities that makes the covariant /contravariant distinction 
important. 

Here's an example. Notice that for any 2 four-vectors A,, and B,, we have 


A,B! = gue A" BY = gu, (AY , A?) (AY BP) 
= US UN. o) A? B? = gag A“ B? = Ag B? (2.20) 
ie. AgB? is invariant (it is the same in any inertial frame). It is the four- 


dimensional analog of a dot-product. Sometimes we will write A- B for the 
dot product: 


A- B= AgB? = AyB? — A;B) = AoB? — A. B (2.21) 


where 3-vectors will always have "arrows" on top (or have Latin indices at- 
tached to them) so we know that's what they are*. 
Notice that the “square” of a 4-vector need not be positive: 


A? =A: A= Ag A? = Ao A? — 4. A (2.22) 


This means that our concept of magnitude needs to be extended in special 
relativity. To do this we define: 


A? >0 timelike 
A? =0 null or lightlike 
A? <0 spacelike 


Why this terminology? It’s descriptive of how spacetime (i.e. space and time) 
appear to a given observer. Consider an observer at a position z^ relative to 
the origin. We thus have either z? > 0, 2? = 0, or a? < 0. This splits 
spacetime up into three regions about any given point, as illustrated in fig. 
2.5. A classification is given in table 2.2. 


* will write A; BJ = AJBJ = A;B; = AJ Bj A.B these all mean the same thing for 
3-vectors. Note that the quantity A? BÊ = A? B? + AJ BJ is not invariant — it will look 
different in different Lorentz frames. 
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timelike wordline 


J lightlike DURS 
wordline 
elsewhere 
elsewhere si 
past 
Relativistic Non-relativistic 
viewpoint viewpoint 


FIGURE 2.5 

The diagram on the left shows the causal structure of space time in special 
relativity. The light cone about a point “P” plays a fundamental role in 
determining the events that can and cannot affect (or be affected by) P. The 
non-relativistic version of this diagram is at the right: all events are either to 
the future or past of P unless they are simultaneous with P, i.e. are in the 
“now” of P. 


2.4 Momentum and Energy 


Because z^ is a position 4-vector, it cannot remain fixed — even if the spatial 
components remain constant, the time a? must increase, since the observer 
cannot remain frozen in time. In the most general situation, all the compo- 
nents of z^" will be changing with time. We can parametrize this change by 
making each component a function of some parameter 7. As we shall see, we 
are typically more interested in how these functions change as 7 changes, so 
let's define 

Az" dat 


po 
U im = 
Ar—0 AT dt 


(2.23) 


which is the rate of change of the position 4-vector with respect to this pa- 
rameter. 

What is 7? If we choose it to be the time coordinate, then we would have 
u^ = (352, q) = (c, az). 'This looks like a velocity in the spatial part, but 
leaves us with a problem: which clock is measuring the time? In Newtonian 
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TABLE 2.2 
Past, Present, and Elsewhere 


everything the observer 


2 = Jt>0 PE potentially will influence 
z^ > 0 > Jet] > |z : : 
t<0 P everything that potentially 
ast E 

influenced the observer 

z? «02 |ct| « |z Elsewhere no influence between 
observer and environment 

r? =0 els Lightcone Boundary between elsewhere 


past and future 


mechanics time is absolute, but in Einsteinian mechanics, moving clocks run 
slow, as we saw in section 2.1.3. 


What we want is for 7 to be a time that all observers can agree on. There 
is one such definition of time — namely the time measured by a clock that 
is at rest with respect to the observer whose position 4-vector is z"(T). For 
example, suppose the observer is on a plane. The clocks on the plane (and 
everything else: the flight attendants, the heart rate of the passengers, the 
movie) tick slow relative to you standing on the ground. If your clock ticks 
off an interval At, the plane's time is Ar where 


Ar = At/7 (2.24) 


Even though the clocks on the plane are moving slower, everybody will agree 
on what those clocks say. So both you and observers on the plane (and 
anyone else moving in some other inertial frame) will agree that the clocks on 
the plane tick off an interval Ar. We call Ar the proper time of the observer. 
It is the shortest time any observer can measure, and it is invariant, since all 
observers (moving or not) agree with what it is. 


Note that if we have a collection of N observers located at z^, each moving 
at different speeds, then each will have their own (invariant) proper time Ta 
(A = L..,N). At ordinary speeds y = 1 to about 1 part in 10%, so we 
never notice this effect, and each observer perceives time as absolute. This 
is why the non-relativistic picture at the right of fig. 2.5 is so useful, and so 
much in accord with our everyday intuition. However, in particle accelerators 
y > 1 and these time-dilation effects are crucial. For example the lifetimes of 
unstable subatomic particles can dramatically increase at very high velocities. 


Since 7 is invariant, u" is covariant: 
dz" dar d da” 


n Bony) AB C _ AB ay 
u dr! dr dr (A zu) A v dr A pu (2.25) 
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and the components of u” are 


dE de? dz dz dz 
a ($, z) dr (e x) y (e x) E (e x) Y (e 9) 
(2.26) 


We see that the spatial component of the 4-velocity is related to the usual 3- 
velocity by u’ = yv’ ~ v’, where the approximation is good whenever |v] < c. 
The magnitude of the 4-velocity is 


uu=Y (4-00) =? (1-8-.0/) =P (2.27) 


which is invariant! 

Note that u? > 0 : the 4-velocity of a particle is always timelike. In 
terms of the causal structure in figure 2.5, this means that the trajectory of 
any physical object must remain within its future light cone, and must have 
emerged from its past light cone at any event P. This trajectory is called the 
worldline of the particle, and if the particle has mass the worldline must be 
timelike as shown in figure 2.5. 

Now let's consider momentum. Since this is (mass) x (velocity) in Newtonian 
mechanics, let's define 

p” = mu” (2.28) 
where we identify the constant m with the mass of the body whose 4-velocity 
is u” since 

p = mu! = ymv (~ mv’ for small v?) (2.29) 
This seems like a reasonable definition of the spatial components of the mo- 
mentum. What, then, is p?? We have 


mc 
p? = mu? = 4mc = 


1—v2/c 
1 1 3 y 

= - |m + m+ m> +... (2.30) 
c 2 8 c? 


upon expanding in powers of 2. The 2nd term in the series is easily recognized 
as the non-relativistic kinetic energy, and so we define 


mc? 


V1 - v/e 
Remember that in non-relativistic mechanics it is only changes in energy 


that are physically meaningful. However, in relativity, every body has a min- 
imum constant energy mc? called its rest energy (or rest mass): 


R — me (2.32) 


p)?— E/c where E=ymc*= (2.31) 


Note that since m is a constant it is an invariant: all observers agree on what 
itis. The kinetic energy is the difference between the full energy and the rest 


energy 


vt 


1 3 
- _ 2 2 
T=E=R=(y-1)mc = ¿mu RA (2.33) 
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which is the usual non-relativistic term plus an infinite series of speed-dependent 
corrections. 
What is the magnitude of p"? It's easy to compute it: 


p =p-p=mu-u=m* (2.34) 


and so p? is an invariant. However, we also have p- p = (Joy? — p- p, and so 


E? = (mc?) + |p? > E = y (me)? + e p? (2.35) 


which gives an expression for the energy of a particle in terms of its mass and 
its momentum, analogous to the non-relativistic relation E — r 

The interesting thing about the relation (2.35) is that it holds even for a 
massless object! Setting m = 0 in eq. (2.35) we find 


E-|p|c and p?=0 (2.36) 


which means that a massless particle has a 4-momentum of zero magnitude! 
We can't make sense out of this from our earlier expressions E = — + 


V 1-v?/c2 


7 —. because both would give zero — unless the massless body 


y/1=v2/02 
travels at the speed of light (i.e. |v] = c). In this case we have a zero-over-zero 
limit, and the preceding expressions become ambiguous. 

Rather than work with such ambiguity, we regard equation (2.34) as being 
the equation that fundamentally defines the 4-momentum of a particle of rest 
mass m. This equation is valid for all m > 0, and yields equation (2.35) for 
m #0, and equation (2.36) for m = 0. 


and p — 


2.5 Collisions 


Energy and momentum are conserved in any process (as we'll see in Chapter 
4), which is why they are useful quantities to deal with. In fact our picture of 
collisions in relativity is quite similar to our non-relativistic picture! Each has 
both elastic and inelastic collisions, and energy and momentum are always 
conserved. The key difference between the non-relativistic and relativistic 
cases has to do with the conservation of mass, something not true in relativistic 
physics. Table 2.3 illustrates the parallels between the two situations. 

In solving collision problems, it is generally a matter of ensuring that the 
relation 37 Pin =% Phut is satisfied. In principle this can be done component- 
by-component. However, in practice such an approach is rather cumbersome. 
A better strategy is to search for invariants, exploiting relationships between 
them to simplify the situation and solve the problem. This is typically done 
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TABLE 2.3 
Collisions 
Collisions 
Non-relativistic Relativistic 


Momentum is conserved 


25 Pin = 25 Pout 


4 — momentum is conserved 


Energy is conserved Pia = Pout > { Xn E e Bout 
Nin = YE Eg in out 
Mass is conserved Mass is NOT necessarily conserved 
2 Min = Mont 2Min FL Mout 

Kinetic energy may Kinetic energy may 

or may not be conserved or may not be conserved 


by isolating the quantities of interest in the problem, and then taking dot- 
products and squares to obtain the desired answer. 


Let’s look at some examples of how to use this formalism. 


2.5.1 Broadside Collision 


Two particles each of mass m and speed v, collide at right angles, forming a 
new body of mass M. What is the mass of M? 


Answer 
The best way to solve this problem is to recognize that the squares of the 
4-momenta of both the incoming and outgoing particles are invariants. 


First conserve momentum: pi + pf = p" 


Square both sides : p? + p? + 2p, p, = p? 


Each of the squared momenta is the rest mass of its corresponding particle, 

and so 

E¡E, 
c 


(mc)? + (mc)? +2 ( Ba: i) = (Mc)? (2.37) 


Now we use the fact that the spatial momenta are orthogonal ( pi.Lp; ) and 
that the energies are related to the rest masses via EA = ya mc?, where 
A=1,2 labels which particle we are interested in. Hence 


2m? (14 y y.) = M? 


1 2—wv?/c 
272 9472 E 
> M*^-—2m (1+ 72) im c (2.38) 
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FIGURE 2.6 
A Broadside collision between two equal mass particles moving at right-angles 
to each other. 


Note that M > 2m. This is a sticky collision and so the mass increases 
because kinetic energy is converted into rest energy. 


2.5.2 Compton Scattering 


A massless particle elastically collides with a massive one (mass M) at rest. 
What is the final energy of the massless particle if it is scattered at an angle 
0? 


FIGURE 2.7 
Compton Scattering of a massless particle from a massive one 


Answer 


Setting c = 1 for simplicity, as before we conserve momentum and find in- 
variants. The most complicated momentum is the momentum of the massive 
scattered particle, since we don’t know the angle it scatters to, nor do we 
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know its final speed. So we isolate p'" and then square both sides: 


First conserve momentum: p” = p" + k” — k” 
Isolate the most complicated 
momenta and square: p° = (p+ k — Kk)? 
Simplify : M? = M? + k? + k’ 
42(p-(k— Kk) — k-k’) 
Rearrange (remember k? 20 = k2) : p-k=Kk'-(k+p) 


Now we need to insert the given information in the problem, namely that 
k" — E (1,4), where Å is a unit vector that points along the direction of 
motion of the initial massless particle. We know that this is the form k^ 
takes because it is null: we must have k"k, — 0. Similarly k'* = E' Er i), 
where we know from the setup of the problem that k- b! = cos0. Hence we 
have k’ -k = EE" — EE'k- k' = EE'(1— cos0). We also know the initial 
4-momentum of the massive particle because it is at rest: p" — (M , 0). 
Putting this into the above gives 


Write in terms of components : ME = E' (E + M) - E'E (i i’) 


= EE'(1— cos0) + ME’ 
ME 
E(1-— cos0) + M 
E 


= 2.39 
1+ 4(1 — cos) on 


Solve for the final energy : F’ = 


Note that the only variable in the problem is the scattering angle 0: the 
energy E’ of the scattered massless particle is fully determined by this angle 
and all other given variables. 


2.5.3 3-Body Decay 


A particle of mass M explodes into 3 identical particles each of mass m, which 
move away from each other at equal angles. What speed(s) do the particles 
move at? 

Answer 


Again, we conserve momenta. The relation is 


p" = pi + ph etr (2.40) 
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FIGURE 2.8 
Diagram for the three-body explosion 


Here we have to make clever use of the fact that all 3 particles come out at 
the same angle. This means that 


m 


BÉ DB; = |B-.|P;|cos0 for every i,j = 1,2,3 (2.41) 


In this example it is more useful to look at the components of the conservation 
law. First let’s check the spatial components 


Spatial momentum conservation => pı = —(p2 + pa) 
= | Pil’ = -pı (P2 + P3) =c080| Pil (Go| + [p3l) 


Doing this for all three particles, we find 


ou Ial | lsd 


[Pol +l] — lps| + lei) [2| + [i 


and so we see that |p1| = |po| = |ps]. 


Now let's use energy conservation. Since p” = (m C, 0), we have 


E, Es Ej E 
pc Sg x ai 


c c c c VA - v/e 


where each energy must be equal since the momenta and rest masses of the 
final particles are all equal. Hence 


v—cy/1—3(m/My (2.42) 


Energy conservation : Mc= 
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2.6 Questions 


1. 


Show that the product Ag B? is Lorentz-invariant but the products 
A? B? and AgBg are not. 


. Suppose we write the velocity v of a particle as v = ctanh y, where y is 


a parameter called the rapidity. 


(a) Find the form of a Lorentz transformation for a particle moving in 
the x direction in terms of this rapidity parameter. 


(b) Consider a succession of two boosts, both in the x direction, with 
velocities vı and vg. What is the value of the rapidity parameter for the 
combination of these transformations in terms of the rapidity parameters 
nı and na for each? 


A particle moving at speed v collides with an identical particle at rest. 
What is the center-of-mass frame speed of this particle? 


The lifetime of the muon is 2.2 x 107% seconds. Muons are produced 
high in the atmosphere (10,000 meters above the surface of the earth) 
from pions in cosmic rays moving at 99.9% of the speed of light. Once 
produced the muons move at the same very high speed. 


(a) How far will the muon travel according to non-relativistic physics? 
Will it make it to the surface of the earth? 


(b) How far will the muon travel according to relativistic physics? Will 
it make it to the surface of the earth? 


(c) The pion lifetime is 2.6 x 1078 seconds. Is it possible for the pions 
to reach the surface of the earth? 


p-E 


FIGURE 2.9 
Diagram for question #5 
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. (a) An 7 moving at 3/4 the speed of light decays into two photons of 


equal energy E, which are each emitted at an angle 0 relative to the 
direction of the 7's motion. Find 0. 


(b) Consider the same process, but now with the photons emitted along 
the direction of motion. What is the frequency difference between the 
two photons? 


. It is possible to define the force on a particle in special relativity as the 


proper-time derivative of its spatial momentum 
pides 

dt 
where p” = mu". 
(a) Show that the acceleration a^ = de of a particle is always orthog- 
onal to its 4-velocity u”. 
(b) Express the relationship between the spatial acceleration d = gg and 
the spatial force F. Do they always have the same direction? 


(c) What is the meaning of the Oth component F°? 


. Neutral pions can be produced from the collision of a photon with a 


proton p + y — p4 «9. 


(a) What is the minimum energy a photon must have for this process 
to take place when the proton is at rest? 


(b) The largest energy a photon in the cosmic microwave background 
can have is about 1 meV. What is the minimum energy that a cosmic 
ray proton must have in order to produce pions by scattering off of the 
microwave background? 


. A pion traveling at speed v decays into an lepton of mass m; and its 


correspond antineutrino De. Suppose the antineutrino is emitted at 
right angles to the direction of motion of the pion. 


(a) Find an expression for the angle that the lepton is emitted relative 
to the orginal direction of motion. 


(b) Suppose a pion of speed v emits a muon at angle 0, and another pion 
at speed v' emits an electron at the same angle, each having emitted 
the antineutrinos at right angles to the direction of motion. How much 
larger or smaller is v compared to v’? 


. Particle A decays into two particles: A — B + C. 


(a) Find an expression for the energy of each outgoing particle in terms 
of the various rest masses. 


(b) Find the magnitudes of the outgoing momenta of B and C. 


(c) Under what circumstances can your answer in part (b) equal zero? 
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10. 


11. 


12. 


13. 


14. 


Find the energies of the outgoing particles for the following decays 
mt > e7 +p. 
K- >n +n’ 
Wo — 2y 
A> p+n 
p nt +r 


Particle A collides into particle B, which is at rest, and three or more 
particles are produced as a result: A+ B — Ci +C2 t +C. 


Find the threshold energy for this reaction to take place in terms of the 
rest masses of the particles. 


Find the threshold energies for the following reactions 
ptp>p+p+r’ 
tm +p—> K°4+ 09 
p+p=>p+K%4Y* 
PEDIDA EA 
ptp>pt+tp+p+p 


Particle A decays into three or more particles: A B+C+D+:-:: 


(a) Find the maximum and minimum energies that particle B can have 
in terms of the rest masses of the other particles in the problem. 


(b) For the decay D? — K~ +77 +e* + ve, find the maximum and 
minimum energies the e* can have. 


Consider a two-body scattering event A+ B — C+D. Define the 
following quantities 


2 2 2 
s= (pat pp) t = (pa — pc) u = (pA — pp) 


which are a set of variables called Mandelstam variables. 


(a) Show that the sum of the Mandelstam variables is a Lorentz invari- 
ant quantity and compute its value. 


(b) Find the energy of A in the lab frame, where particle B is at rest in 
terms of the Mandelstam variables and the rest masses of the particles. 


(c) Find the center-of-mass energy of A in terms of the Mandelstam 
variables and the rest masses of the particles. What is the total energy 
in this frame? 
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Symmetries 
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One of the most fundamental notions of physics is that of symmetry: the 
idea that certain systems, or properties of a system, or laws governing a 
system, remain unchanged when you do something to them. For example, the 
gravitational field of a homogeneous spherical object is rotationally symmetric: 
it is the same no matter how you orient yourself about the sphere. Another 
example is a crystal: an arrangement of atoms that looks the same if you shift 
it in space in a certain direction. You can learn a lot about a physical system 
(and the mathematics that describes it) just by knowing what symmetries 
it has. In physical theories symmetries manifest themselves in terms of two 
basic notions: invariance and covariance. 

Invariance is the term used to describe properties of a system that do not 
change when a symmetry transformation is performed. Ifa quantity is invari- 
ant, all observers will agree on its value. We’ve already seen that the proper 
time of an observer is an invariant under the Lorentz transformations: all 
observers agree on what the proper time is for any given observer. Rest mass 
is another such invariant. 

Covariance is the term used to describe properties of a system that change 
in accord with the changes induced by the symmetry transformation. For 
example, if the equations of motion describing a given system have well-defined 
transformation properties when a given symmetry operation is performed, 
we say that these equations are covariant (i.e. they vary along, or co-vary, 
with the transformation). In the above example, the equations describing 
the motion of a body around a homogeneous spherical object are covariant 
with respect to rotations — they will transform in a manner consistent with 
rotational symmetry. 

We'll see later that symmetries necessarily result in conservation laws. 
'These laws may be used to obtain new information about a given system, 
which in turn may yield further laws. In general a given interaction respects 
many conservation laws: conservation of momentum, angular momentum, 
charge etc. This stringently constrains the possible mathematical description 
of the interaction. 

Symmetries can be either continuous or discrete. For example, time-reversal 
is a discrete symmetry (all clocks either advance or retreat), whereas time- 
translation is a continuous one (the zero of time is a completely arbitrary 
choice). Discrete symmetries result in multiplicative conservation laws, where- 
as continuous symmetries yield additive laws. 
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3.1 Groups 


Mathematically, symmetries are properly described by making use of two 
concepts: that of a group and that of an algebra. Let's look at both of these, 
with some examples [24]. 


3.1.1 Axioms of a Group 


When we think of a symmetry, we think of performing certain operations, 
or transformations, on a system that leave some of its basic properties un- 
changed. We can collect all of the relevant transformations into a set, 
G = (91,92...) (which may be infinite in size), and then ask what basic 
properties the set should have in order for it to describe a symmetry. Math- 
ematically the answer is given in terms of what we call a group. 

A group is a set of G = (91, 92, ...] of objects with a binary operation “o” — 
some operation that allows us to combine two elements in the set — that has 
four properties: 


TABLE 3.1 
Properties of a Group 
CLOSURE If 91,9 € G => 91992 EG (combinations 
remain in the set) 
IDENTITY There exists I € G (one element 
=> I ogi = gi for every gi € G does nothing) 
INVERSE Every g; € G hasa g' EG (combinations 
such that gı og: =I can be undone) 
ASSOCIATIVITY If 91, 92,93 € G (combinational 


= (91 © 92) © 93 = gı © (920 g3) groupings can 
be interchanged) 


These properties are (in most cases) the minimum ones needed for trans- 
formations to be meaningful in physics, and for the most part are motivated 
by common-sense considerations. If the elements g; are transformations, then 
we'd like a combination of two transformations to be the same kind of trans- 
formation (closure). We also want to undo transformations (inverse), and not 
do transformations (identity). Finally, we'd like our answer to be indepen- 
dent of how we combine any three transformations together, provided we don't 
change their sequential order (associativity). Note the order of applying the 
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“o” is relevant: in general gı o g2 4 go © gı for any two elements”. 

How do we know if something is a group or not? The only sure way is to 
check all four properties. Let's look at the integers as an example, shown in 
table 3.2. A simple check shows that the integers form a group under addition, 


TABLE 3.2 
Integers as a Group 
Integers: G = Z 
Addition (e = +) Multiplication (o = x) 


Closure Li+Z2€Z "4 Zx% EZ "4 
Inverse Zi + (-Z;) =0 Y Zi x 1/(Zi) 21 


but 1/(Z;))¢Z x 


Associativity (Zi + Z2) + Za (Zi X Z2) x Za 
=2Z,4+(Z24+2Z3) Y =Z1ix(Z2xZ3) Vv 


but not under multiplication. However, in this latter case, if we modify the 
set of integers Z to include all fractions and exclude 0, we get a group Q: the 
non-zero rational numbers under multiplication. 

This latter example is a common feature: if we have a set of elements that 
don’t form a group under some combining operation, we can sometimes get 
a group by either modifying the set, or modifying the combining operation 
(or both). It is quite common in physics to have a set of transformations 
(typically implied by experiment) that almost satisfies the properties of a 
group, but not quite. Often it is obvious how to generalize the set so that 
the group properties are satisfied, although there is no general prescription 
as to how to do this. Part of the job of a theoretical particle physicist is to 
make intelligent guesses as to how to find such generalizations to ensure that 
a given system has a desired symmetry. This symmetry can then be exploited 
to help make further predictions about the system that can then be tested 
against experiment. 


Table 3.3 provides an example of another kind of group. This group is 
called U(1), the Unitary group of complex 1 x 1 matrices. Unlike the group 


*If gı © g2 = g2 © gı then we say that the group is abelian. Transformations that shift 
location in space (translations) yield the same result no matter what order they are applied 
in, and so they are abelian. Rotations in space about a fixed point are not abelian — a 
rotation about the x-axis followed by one about the y axis will yield a different result than 
performing these operations in the opposite order. 
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TABLE 3.3 
Complex Phases as a Group 


Complex Phases (G = U(1) = {e* } , o = x =multiplication; 0 € R) 


Closure et x ei?» € U(1) Y 

Identity el x e = gh Y 

Inverse eti x ei ¿01 "4 

Associativit et x gi02) x ed = gh x (601 x E293 "4 
y 


of the integers under addition, which has countably infinitely many elements, 
this group has an uncountably infinite number of elements since the elements 
depend continuously on a real parameter 0. As we'll see in Chapter 12, this is 
the symmetry group of the electromagnetic interactions. 


3.1.2 Representations 


So far we've discussed properties and examples of sets of transformations. 
But how are groups of transformations used in particle physics? What is it 
that the transformations act on? 

In particle physics the most common thing that is transformed under a sym- 
metry operation is the wavefunction of a particle or set of particles. In general 
the wavefunction can be written down as a multicomponent single column 
matrix with complex entries (a complex column vector) in some (abstract) 
multi-dimensional space’. The symmetries of the system are transformations 
that act on the wavefunction via multiplication by complex matrices. 

In (almost) all cases in particle physics, the groups we are interested in are 
sets of complex matrices, and the combining operation is matrix multiplica- 
tion. So what we need is a way of understanding group elements as matrices. 
'This leads us to the concept of representations. 

If we have a symmetry group, how do we write down the elements of the 
group in terms of matrices? Or alternatively, if we have a set of matrices, 
how do we know that these matrices correspond to the group elements of the 
symmetry that we are interested in? 

It is a result of the mathematics of group theory [25] that every group G 
can be represented by a set of matrices: in other words 


For every g € G there is a corresponding matrix M, such that 
n° 92 = 93 > My, My, = Mg, 


which means that the matrix multiplication table of our set of matrices must 
correspond to the multiplication table of the group. A set of matrices {M,, } 


In introductory quantum mechanics, the wavefunction has only a single component, i.e., 
it is one complex function. In particle physics it will commonly be necessary to generalize 
it to multiple components. 
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that has this property is called a representation? of the group G. 

The main advantage of using a matrix representation of a symmetry is 
that you can carry out explicit calculations. Since matrix multiplication is 
associative, the associativity property is automatically satisfied for any rep- 
resentation. Furthermore, the inverse property demands that all matrices in 
a representation must be invertible, which significantly reduces the possible 
forms the matrix representation of a group can have. Finally, the identity 
group element is easily represented by the identity matrix. 

'There are several somewhat counterintuitive points about representations 
that are worth noting: 


e hepresentations are not unique. Since the only constraint on a set of 
matrices to represent a group is that it reproduce the multiplication 
table as described above, it is clear that a group can have many different 
matrix representations, since there are many ways of doing this. In fact, 
a given group G has infinitely many representations! 


e Representations in general are not faithful for the same reason: typically, 
more than one element of a given group is represented by one matrix. 
'The most extreme case of this is to represent all elements by the identity 
(the n x n unit matrix)?. If the representation is 1-1 (i.e. if every group 
element is represented by one and only one matrix — and vice versa) 
then we say the representation is faithfull. A faithful representation 
provides the maximum amount of information that we can have about 
the group. 


e A group of matrices is already a faithful representation of itself: this 
is called the fundamental representation. In general it is possible to 
use other matrices to form different representations of the same matrix 
group. For example the group of 2 x 2 unitary matrices (called SU(2)) 
can be represented by matrices of every possible dimension: 1 x 1,2 x 2, 
3x3,4x4A, .. 


Strictly speaking, a d-dimensional representation is a map p (called a homomorphism) 
from the group to a set of dzd matrices M = (Mg: 


p: —5 M 


$Note that this representation causes us to lose all information about the multiplication 
properties of the group. 

So why not only use faithful representations? Why would we be interested in any rep- 
resentation that was less than faithful? One reason is that we often don't need all of the 
information about the group. A good example of this is to represent every group element 
by either +(Identity) or —(Identity) — this will tell us about the basic even/odd properties 
of the group, and hence something about the basic physics of the system. For example, if 
the group is the group of rotations, then this kind of representation tells us whether or not 
we have a reflection, i.e. a transformation of odd parity. 
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3.1.3 Irreducible Representations 


How do we know what all the representations are for a particular group? This 
is a major problem in group theory, one complicated by the fact that you can 
combine old representations together to form new ones: 


Suppose My, My, = My, and Ng, Ns, = Ns, 


Mg; is an m x m representation 
where d : G 
Ng, is an n X n representation 


Then L, = Mg, 0 is also a representation: Lg, Lgo = L 
Gi 0 No; 91-92 93 


and is an (m + n) x (m + n) representation 


This shows why a group has infinitely many representations: we can always 
keep enlarging the matrices from smaller representations in this block-diagonal 
way. 

But this is redundant! Clearly we shouldn't count the set ( Lj, ) as a separate 
representation, because it is a trivial combination of smaller, block-diagonal 
representations. What we really want to know are the non-redundant sets 
of matrices that can represent a group. We call such sets ?rreducible rep- 
resentations. Specifically, any representation whose matrices simultaneously 
cannot all be decomposed into block-diagonal form is called an irreducible 
representation!. Once the irreducible representations (irreps) are known for 
a given group, then all possible representations are known, because they can 
be constructed out of the irreps. 

So in particle physics (and other kinds of physics), if we have a system S 
that is believed to have some symmetry S, we conceptualize our treatment of 
the system as follows: 


1. Find the group G associated with the symmetry S. 
2. Find all the irreps of the group G. 


3. The wavefunctions that transform under these different irreps are the 
only wavefunctions that are mathematically permitted — and hence phys- 
ically realizable** — to describe the physics of the system S. 


For example, the group SU(3) is the symmetry group under which the 
color charges of quarks transform (we'l see why later). This group has 
representations of dimension 3,6,8,10,.... Hence quark wavefunctions must 
be complex column vectors with either 3,6,8, 10, etc. entries. These are the 
only kinds of quark wavefunctions that can exist if SU(3) is the symmetry — 


Il Mathematically we say that irreducible representations have no invariant subspaces. 
** These wavefunctions are commonly called multiplets, where the “multi” part refers to how 
many components the wavefunction has. A doublet has 2 components, a triplet 3, etc. 
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the math won't allow anything else! Likewise, bound states of quarks must 
have wavefunctions that are one or more of these same dimensionalities. 

How do we find (step 2) the different irreps of a given group G? This is 
a major task for group theory [26], one beyond the scope of what I'm able 
to present here. However, the problem has been solved (and a well-defined 
procedure exists) for a certain class of groups that are of interest in particle 
physics: Lie Groups! 


3.1.4 Multiplication Tables 


You are no doubt familiar with the multiplication table 
x|1/2/|3 


1 (11213 
2 /2)4/6 
3 (3/619 


where I’ve only written down the first few entries for all the natural (counting) 
numbers. The table tells us to combine two numbers under multiplication to 
get another number: by choosing a number z in the leftmost column and 
another number y topmost row, we can find out what the answer is when we 
multiply these two numbers together simply by looking at where the row that 
x is in intersects the column that y is in. In a similar manner, a multiplication 
table provides a straightforward way of writing down all of the information 
about a given group. 


For groups of very large or infinite size this is generally not very practical. 
But for groups of a reasonable finite size it can provide a handy way of letting 
us know what kind of information the group contains, and what kinds of 
relationships exist between its various elements. 


One of the interesting things to consider in a finite group is what happens 
when we take a given element and keep combining it with itself. Consider the 
sequence of elements obtained by taking successive “powers” of some element 
x € G of order ni x, a?-— roc, a? = xo2x02,... We can show that 
eventually this sequence repeats itself in a cyclic manner. This is not too 
hard to see. Suppose that the first repeated element in the group is zP and 
that it is repeated after q + 1 steps. Therefore z?^! = g? for p < qd 1. 
Consequently 


—1 


zl ortl = ag! 


on? > r! =! > p=1 


because we can't have z?-! appear in the sequence before x? unless p = 1. 
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3.2 Lie Groups 


A Lie Group is a group in which the group elements are smooth (continuous 
and differentiable) functions of some finite set of parameters 0, € R, and in 
which the “o” operation depends smoothly on those parameters. A typical 
element g € G is written as 


g = g(01,...,0N) = exp liða T^] = exp [id T| (3.1) 


where a = 1,..., N and {0a} are continuous parameters 


The T*'s are mathematical objects called the generators of the group. There 
are N of them - they are called generators because if you know what they are, 
you can construct any element of the group that you want by exponentiating 
as above — in other words, they generate the group. So the T°’s contain all 
the relevant information!! of the group in a very economical form! 

Lie groups are important in physics because most of the symmetries we 
consider are continuous symmetries. For example, rotations depend continu- 
ously on parameters called angles. A given angle specifies how much rotation 
has been carried out around a given axis. 

Almost all of the important symmetry groups in particle physics are Lie 
groups [27]. The most important of these are the UNITARY and ORTHOG- 
ONAL groups. In particular, SO(3) is the group that describes rotational 
symmetry, a symmetry we believe to be true of all the laws of nature?*. It 
is almost identical in structure to another group, SU(2), which is the most 
important internal symmetry group. In general, in particle physics, we refer 
to symmetries as being either spacetime symmetries (they transform space 
and time co-ordinates in some way) or internal symmetries (they transform 
wavefunction components and/or charges amongst themselves). 

'The action of any group on a wavefunction multiplet will result in a re- 
arrangement of multiplet components. If the group is a symmetry, then we 
mean that the physics of the system is insensitive to this rearrangement. 

Lie groups have been completely classified in terms of their transformation 
properties. Table 3.4 lists all the kinds and their uses in particle physics. 


tf Actually this is not quite true. The T®’s contain all local information about the group — 
in other words, if you know all N of the T*’s then you can construct any group element that 
is continously connected to the identity element. However, other group elements — those not 
continuously connected to the identity — cannot be constructed in this way. This property 
is relevant in constructing the vacuum state in quantum electrodynamics as discussed in 
Chapter 25. 

HH Of course this assertion must be tested by experiment, which so far has provided no 
indication that rotational symmetry is not a symmetry of the laws of physics at their most 
fundamental level. 
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3.3 Algebras 


Closely related to the notion of a group is that of an algebra. An algebra is a 
vector space V = (vr) with a binary (combining) operation “o” such that 


vl ov gu. K — Oly (3.2) 


where the summation convention was used in the last step. This rule ensures 
that when vectors in V are combined, they always give a vector in V, which can 
be expressed as a linear combination of the other vectors in V ; the C17 s 
are the coefficients of this combination, known as the structure constants. 
Note that all we care about here is closure — there doesn't have to be either 
an inverse or an identity! The relation (3.2) implies that knowledge of all 
possible C77 x's is equivalent to knowledge of the combining operation “o” — 
in this sense the C77 s characterize the algebra under consideration. The 
I, J in the C#7 x are not raised using the metric, but rather are placed there 
for ease of notation. 

A familiar example of an algebra is the vector cross-product. The set of 
elements in the algebra is the set of vectors in 3-dimensional space, and the 
combining operation o = x, the cross-product. Any two vectors d and b 
combine under the cross-product to give a third vector d x b, which is also a 
vector in the 3-dimensional space. In terms of the unit vectors {%, 9,2} of a 
Cartesian coordinate system we have 


EXY=Z GX = 2xt=4 
with all other cross-products zero. In this example C ,. turns out to be the 
Levi-Civita symbol e!7%, which shall be defined below. 


3.3.1 Lie Algebras 


Why is an algebra important? The reason is that the Taylor-series expansion 
of a Lie group G gives us an algebra called a Lie Algebra (denoted 6). The 
demonstration of this is given in the appendix. The elements of 6 are the 
generators T" of the Lie group, and the combining operation o = [,] is the 
commutator 

[pe ppp (3.3) 


where the CZ/'s are denoted by the f**'s. The group associativity law implies 
the relation 


[[T°, T"], T°] + [ERE T°] , T°] + TEES T, T"] = 0 (3.4) 
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which is called the Jacobi identity. Any algebra for which (3.3) and (3.4) hold 
is a Lie Algebra. 

As noted previously, we can write the group elements of any Lie Group in 
the form g (61,...,0w) = exp (iĝa T*) . Consequently, if we know what all of 
the T's are, we can produce any element of the group that we want. As 
stated above, we say that a Lie group is generated by the T°’s, which are called 
the generators of the group. Hence we can study (most of) the symmetries 
of physics simply by studying their infinitesimal structure using Lie Algebras 
[28], instead of the more complicated structure of Lie Groups. 


3.4 The Rotation Group SO(3) 


A rotation about, say, the Z-axis is given by: 


xv’ =c0s0 xz + sin0 y x = r+0y=zx-— (02 x T) ĉ 


y' = —sinĝ x + cosüy Small 0 y’ = 944 y= y — (02 x 3) - f 9) 


( More generally 1 = z— (0 x 7) where f= 02 ) 


Do the set of all possible rotations form a group? It's clear that any two 
rotations yields a rotation (closure), that rotation by an angle 0 — 0 is the 
identity and that every rotation can be undone simply by performing the 
same rotation by the negative of the angle. So if we use matrices to represent 
the rotations, we'll have a group (since associativity will automatically hold). 
Moreover, we know that any rotation can be constructed by taking products 
of matrices from the set 


1 0 0 cos 0,, 0 — sin By cos, sinó, 0 
R= 0 cos@, sind, |, 0 1 0 , | —sin 0, cos 6, 0 
0 — sin 6, cos 6, sin 0, 0 cos Ay 0 0 1 


and so our group G = R is the set of all matrices that are products of ma- 
trices in the above set, where (0,,0,,0.) are three arbitrary parameters. A 
general group element t € R will be a function of these parameters, i.e. 
t = t(0,,0,,0.). This group is the orthogonal group of 3 x 3 matrices of 
determinant one, so it is SO(3). 

Let's construct the Lie Algebra of SO(3). We can do this by examining the 
elements of R, rewriting them in the form exp lio T!] c I -- iQ T! for 0, < 1. 
For example 


cos, sinó, 0 1 6,0 
—sin6, cos, 0 | ~ | -0, 1 0 | +0 (62) 
0 0 1 0 01 
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100 0 10 
={010]+86, | -100 
001 0 00 
=1+i0,T* (3.7) 
0 10 
which means that T^ = —i | —100 |. This is the generator of rotations 
0 00 


about the Z-axis. Performing a similar exercise on the other two matrices, we 
obtain the set of generators 


000 00-1 0 10 
{T*,TY,T7}=2-i[001],-2] 00 0 J,-i[| -100 (3.8) 
0—10 10 0 0 00 


which form the basis of the Lie algebra of SO(3), which we call so(3). We 
can check the commutation relations 


0 0 0X /00-1 00-1\ /0 0 0 
rT4]-(-)?loo1i][ooo | -(-i)?| 00 0 001 
0—10/ \10 0 10 0 0-10 
0 10 
= | -100 | = iT? (3.9) 
0 00 


with the rest being similar: [TY, T^] = iT” and [T^, T^] = iT”. So altogether 
we have 
[pep pue queer du enge | (3.10) 


which are the commutation relations of the rotation group familiar from quan- 
tum mechanics (e.g. in solving for the wavefunction of the Hydrogen atom). 

The object c?^is called the Levi-Civita symbol or (more commonly) the 
epsilon tensor. We can write it in several equivalent ways: e“? 


_ ¿cab =€ 
E = Écab - 
It has the following properties 


0 if any of c, a, bare equal 
cach — petab __¿cba switches sign under interchange 
gabe — E E of any pair of indices (3.11) 
= petta — —¢bac 
1 ifa=1,6=2,c=3 


The epsilon tensor plays a role in the cross-product analogous to the role that 
the Kronecker-delta symbol plays in dot product. We have 


A. B = 6% Aj p* (4x BJ = tik Aj Bk (3.12) 
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or more explicitly 
A-B=A'B! + A2 B? + A3B3 
"AE TEM 
(4 x B) = 2g — A? B? (4 x B) = AB! — A'B? 


(4x By = ALB? — A2B1 


Just in the same way that 5’ can be defined in n-dimensions (by letting 
the indices 7,7 take on all values from 1 to n), so also can the epsilon tensor 
be defined in n-dimensions, with the key distinction being that it has as many 
indices as there are dimensions. So in 2 dimensions, the epsilon tensor has 
two indices (etf), in 3 dimensions three indices (&'/*), in 4 dimensions four 
indices (e*"%P) and so on. We will be making use of all of these dimensional 
versions of the epsilon tensor in this text***. 

The algebra so(3) (and its Lie group SO(3)) is so commonly used in physics, 
we give its generators a special notation: T? = J*, so that the set of generators 
{J*, J”, J7} obeys 

jo TT ieee a (3.13) 
and using (3.11) it’s not hard to show that 
eP goce By cbe ¿dar fe Ela cdbe —0 (3.14) 
and so from (3.13) we have 
[[35,3*] ,39 + [5,37] ,3%]  [[35,3*],3*] 20 (3.15) 


which means the Jacobi identity (3.4) holds. So so(3) is indeed a Lie algebra. 
As noted above, each group element of SO(3) can be written in the form 


j 


exp [id . J], which is the matrix that corresponds to a rotation of angle 


about the 6 direction. For example. if = OY: 


> > c (1934)" 
exp E : Jd = exp [i6J"] = 5 Cn 
n=0 ` 
100 oo nign /100 
—-[010]4 c g 000 (3.16) 
001/ xu €u 001 


(2n + 1)! 10 0 


*** There is an important distinction in 4 dimensions compared to 3 and 2, having to do with 
the raising and lowering of the indices. In 3 spatial dimensions we have «2° = e*^* = e, . 
However, it is NOT true that e""eP = e, vaP — ¿1,9 etc. Instead we must raise/lower 


indices of e*Y%B using the metric Juv: 
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0 oo (-1)"(0)"*! 
Ip CUT 0- Yo ee 
0 1 0 


xou uc ee e UE 
cos 0 0 — sin 0 

= 0 1 0 (3.17) 
sin 0 cos0 


which is a rotation of angle 0 about the j—axis. From this we see that J” 
generates rotations about the y-axis. Using similar methods you can show 
that J” generates rotations about the x-axis and J? generates rotations about 
the z-axis. 

'The above formalism means that we can rewrite a general element in the 
set of rotational transformations (3.6) in the form 


AN 7 
dum (exo [ia - jj) Ake (3.18) 
which for small angles |a;| << 1 gives 
qc (6,7 — ean) £j = £i — (Q x 2), (3.19) 


of which eqs (3.5) are a special case. 


3.5 Appendix: Lie Algebras from Lie Groups 


As I noted earlier in this chapter, a Lie group, when “Taylor-series expanded,” 
becomes a Lie Algebra. Here I will outline in more detail how this works. 
Closure of the Lie group implies 


exp (16, T") exp (iT , T^) = exp (/Z.(O, T) T*) (3.20) 


where E, (0, Y) must be an analytic function of (O, Y). Clearly =,.(0,0) = O, 
and E,(0, T) = Y.. In fact, =, can only depend on odd powers of O and Y; 
otherwise E,(O, —O) won't equal zero, which it must equal when Y, = —0,. 
Hence 


E.(0, T) = O. + T. — sito aly + O(OT? or e?) +--- (3.21) 
where the f*? are coefficients in the series expansion. 


Expanding the exponentials on both sides of equation (3.20) 


QUA 
14 i(0,- Ya) T^ + B (0,0, + T, T) T*T* + (i?O, T, T T? + 


; 212 
=1+1(0:+ T)T'— jf e, T, T* 4 e (Oa + Ta) (s + Yo) TIT? + 
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gives upon cancellation 


i 
-Oa Tp TIT" = -50a Te (T"T* + T^T*) — AOL To T* (3.22) 


: 
= -594T, (T"T* — T^T*) = -5f' Te, TV T* (3.23) 


This must be true for any O and Y, so 
Te i) cea agra i (3.24) 


which is (3.3). 


3.6 Questions 
1. Consider an equilateral triangle, with the following set of symmetry 


operations 
I identity 
Ry Positive rotation of around center 
Negative rotation of 2r around center 


R_ 

A Reflection through a line joining vertex A to the midpoint of line BC 
B Reflection through a line joining vertex B to the midpoint of line CA 
C Reflection through a line joining vertex C to the midpoint of line AB 


M 


8) 6 


FIGURE 3.1 
Diagram for question #1. 
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(a) Show that these six elements form a group. 


(b) Construct the multiplication table of this group. 


. Find a non-trivial one-dimensional representation of the triangle group 


in question #1. 


. (a) Find multiplication tables for the groups G2 = {I,a} and G3 = 


{I, a,b} respectively consisting of two and three elements each. 
(b) Are the multiplication tables unique? Why or why not? 


(c) Find a multiplication table for the group G3 = {I, a, b,c} consisting 
of four distinct elements. Is it unique? 


(d) Find the multiplication table for the group A consisting of symmetry 
transformations on the equilateral triangle. 


. An orthogonal matrix is defined by the relation RTR = 1, where R is 


an N x N matrix, where T refers to the transpose, (R7); = ds 
(a) Show that the set of all orthogonal N x N matrices forms a group. 


(b) Show that the set of all orthogonal N x N matrices of determinant 
1 forms a group. 


. A unitary matrix is defined by the relation U'U = 1, where U is an 


N x N matrix, and the f operation means take the Hermitian conjugate 
(the complex-conjugate of the matrix transpose), i.e. (U e =U". 
(a) Show that the set of all unitary N x N matrices forms a group. 


(b) Show that the set of all unitary N x N matrices of determinant 1 
forms a group. 


. A symplectic matrix is defined by the relation ST &S = &, where S is an 


2N x 2N matrix and where & is a matrix of the form 


"mS 0 Iw 
~ \-In O 
and where Iy is an N x N identity matrix. Show that the set of all 
symplectic N x N matrices forms a group. 


. Consider a set of three objects (i, j, k} with the following properties 


=k jk=i ki-j 


and where i? = j? = k? = —1. Show that the set {1,i,7,k} forms a 
group under multiplication using these rules. 
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FIGURE 3.2 
Diagram for question #8 


8. 


10. 


(a) Find the general form of a Lorentz transformation for an arbitrary 
velocity Y as shown in 3.2. 


(b) From your answer in part (a), work out the general form for velocity 
addition for two velocities Y and 4. 


(c) Under the velocity addition formula in part (b), does the set of 
velocities form a group? Why or why not? 


For matrices A and B, show 


exp[-A]Bexp A = exp(-adA)B where adAB - [A, B] 


(a) Show that the Jacobi identity implies 
forc figo + prep = 0 


(b) Show that the structure constants of a Lie group generate a represen- 
tation of the group, i.e. that (T«)* = if"? generates a representation. 
'This is called the adjoint representation of the group. 
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DOI: 10.1201/9781420083002-4 


In the previous chapter I mentioned that symmetries necessarily result in con- 
servation laws. In fact, it turns out that a conservation law exists for every 
symmetry. This is very useful, because conservation laws are something that 
can be checked experimentally. Even better, from an experimental perspec- 
tive we can turn the reasoning around, and inductively conclude that if a 
conservation law is present, then the underlying theory should have a related 
symmetry. 

So what is the connection between symmetries and conservation laws? Re- 
call that à symmetry is an operation on a system that leaves some (or all) of 
its properties unchanged. Rotations of a sphere are an excellent example, in 
this case, of a symmetry of shape. A crystal forms another example, in this 
case, a symmetry of structure, in which the atoms forming the crystal can be 
displaced in a certain way that replicates the crystal structure. 


In particle physics, fundamental symmetries are revealed not so much in 
terms of shape or physical structure, or even in terms of the motions of par- 
ticular objects or systems. Rather they are revealed in terms of the set of 
all possible motions a system can have. In other words, symmetries are in 
general manifest in the EQUATIONS OF MOTION of a system, rather than 
in particular solutions to these equations*. To see how this works let's recall 
the basic formulation of the equations of motion from Lagrangian dynamics. 


4.1 The Action Principle 


To describe the motion of a body in classical physics, we assign it a set of 
coordinates q(t), that in general are functions of time, since the body will be 


*Of course it is possible for a particular solution of a given theory to have a high degree 
of symmetry. But when we talk about a symmetry of nature, we are referring to general 
properties of the equations and not to particular solutions. 
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moving around!. To deduce its equations of motion we require that the action 


S[q] = n d] where 17 {gq1,42,45, vds (4.1) 


— QU 
qi = dt 


be stationary with respect to variations of the trajectory. Conceptually, this 
means that we modify the actual (and at this point unknown) path of motion 
that the particle takes and change it in some small but arbitrary manner (see 
fig. 4.1). This is done point-by-point along the trajectory, so we replace in the 
action q(t) with q(t)+dq(t), where óq(t) is the arbitrary small change. By 
demanding that the action be stationary with respect to small changes of the 
trajectory, we in fact require that the action is minimized (or more generally, 
extremized) with respect to these small changes. Hence we require 


0 = 64,5 = Sla + dq] — S[q] 


e OD OL | 


tp OL d (aL OL 
= 2 LREICIILEE 


I 


tr 


(4.2) 


tr 


The “+---” in eq. (4.2) means that I neglect any terms proportional to 
(6q;)°. The third line follows from the second via an integration by parts, 
where dq, = 82€ = 4i. 

'To proceed further we need to incorporate additional information: namely 
we assume óq; = 0 at the endpoints of all the trajectories. The reason for 
this is that we want to find out what equations of motion take our body 
from a given set of initial conditions to another given set of final conditions. 
The only way this comparison will be meaningful is if we require that all 
the trajectories in our variation have the same initial (and final) conditions, 
i.e., the same endpoints. This means that dq; = 0 at these endpoints. This 
eliminates the last term in the preceding expression. Since óq; is otherwise 
arbitrary, eq. (4.2) can only be satisfied if 


OR SHE OR um. isi ti (4.3) 
=—-asals-]= er-Lagrange equations f 
Og dt \ Od; T Ai 

where I have written the general name these equations get due to their special 
significance. We can alternatively write them as 


What we are doing here is considering the body to be a point mass. However, if we were 
interested in the detailed structure of the body we could assign each of its constituents a 
coordinate, idealizing each constituent as a point mass. 
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FIGURE 4.1 
A sample trajectory (solid), and a variation of this trajectory (dot-dash). 


where 


Pi Z i; 
a quantity known as the canonical momentum pij. 

So what I have shown is that solutions to the Euler-Lagrange equations 
are trajectories q(t) that extremize the action. For a given set of initial 
conditions, any solution q(t) to the equations (4.3) is regarded as a possible 
motion of the system. Conversely, functions q(t) that do not obey equations 
(4.3) will not yield an extremal action. 

The variational principle is a very powerful principle — it implies that equa- 
tions of motion governing a system are determined by extremization of a 
scalar functional (the action) [29]. It generalizes to quantum mechanics, gen- 
eral relativity and quantum field theory. Pretty much all of the equations of 
fundamental physics are founded on an action principle. The scalar character 
of the action makes it ideal for introducing symmetries. 

When we consider a symmetry of the system, we are contemplating a group 
of transformations G = {g;} such that 


q. = gia] where invariance —  S[q/] = S{g[q]] = S[q] (4.5) 


Suppose we choose to describe the motion in terms of the transformed vari- 
ables q'(t^). What equations of motion will they obey? We can find them by 
computing the variation 


óq; óq; óq; 
=> " = — E = —— rd = c 7 d d 
0 = ôq; Sla] a (sla]) TT (Slg[q]]) us a, Sla] 
> 84 S[q] =0 (4.6) 


where the first equality holds provided q(t) solves (4.3) and the last holds 


ôq; 
because res z 0. 
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Hence if q(t) solves (4.3), then the transformed quantities q'(t') will also be 
a solution to a set of equations of motion. These will not in general be identical 
to the set that the q(t) obey, but they will be obtainable from this set using 
the same transformations: an invariant action yields covariant equations of 
motion! 

In other words both q;(t) and q;(t') = glq(t)] extremize the action if 
Slg|q]] = S[q], provided G is a group. This last requirement ensures that 


det 5 40 (all group elements must have an inverse, ensuring = Æ 0); were 
it not to hold then we could not ensure that ôg; 5[q'] = 0, but only that some 


linear combination of variations of S[q'] would vanish. 


nmm—— — Án | 

4.2 Noether’s Theorem 

An even more general principle holds whenever we have a symmetry, namely, 
For every symmetry of the action there is a conservation law 


which is called Noether's theorem, obtained by Emmy Noether in 1917. 

To see why this is true, notice that for a fully arbitrary variation (one for 
which neither the equations of motion (4.3) hold nor for which the endpoints 
are fixed) we have 


dd tr [0L d (aL DI e 
“i [ 0.4 [ E dt tal HEX c " e 
SN ab oro d 
ôg L = ES P. (5:)] Óqi 4 ET [p;óq;] (4.8) 


Now let's consider a (small) symmetry transformation, parametrized by a set 
of parameters a = (01,02,03,..., Qm} so that 


qi > d; = qila] = gila, q] 


> da; Qi =q; Pa 


da +++: 


and where we define a = 0 to be the identity transformation (qj[a = 0] = 
g9i[0, q] = ai). 

Let's see what happens if set ôjqi = ôa; qi, neglecting terms of order (Sa)? 
and higher. First we see that equation (4.8) becomes 


d| Od] ƏL [oL d (AL) aq (4.9) 
dt Pi Jo; ~ ðoj 09; dt 0d; da; ` 
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to leading order. Next consider the action: invariance of the action means 
Slq'] = Slq+6a,q| = Sla], or alternatively ôa, S = 0. This can only be true if 


0 = ĝa S = e dt (ba, L] = e dj 2h 
tr tr s 


'The most general condition under which this holds is 


OL d ðq’ 
= — 4.11 
a=0 dt fe (a ðaj ) | a=0 ( 


da; 
provided the function G is some function of q and 9, q that vanishes at the 
endpoints. This is the most general form ða, L can take that vanishes upon 
integration. The function G is some function that must be calculated case-by- 
case for each given symmetry. 

The important thing to note here is that this equality follows because we 
have a symmetry transformation. For a symmetry transformation, da, L must 
be a total time derivative, whereas for an arbitrary variation it is not in general 
a total time derivative (unless the equations of motion are satisfied), as is clear 
from eq. (4.8) above. We can form an object 


-0-e (4.10) 


a 


ðq; dq! 
Pe OU, 2a 4.12 
3 (» Oat M (s aa) E 19) 
and easily show using (4.9) that 
d, (d Od; d 
at s li Boe ^| - 70) is 
B d (OLN] 3q; de 
i = ín dt \0q,/| 00% dt diis 
Y (zz - 3 dt o) 
=0 (4.13) 


which means that Je is conserved: it stays the same at all times, provided the 
equations of motion are satisfied. Note that we impose the Euler-Lagrange 
equations to get the third line; the last line follows from (4.11). 

The current Je is called the Noether current associated with the symmetry 
a. Equation (4.13) is Noether’s theorem: the symmetry parametrized by 
the a’s ensures that Je does not change with time. In other words J, is 
conserved. Noether's theorem is very far-reaching — I’ve only proved it for 
classical mechanics, but it can be shown to hold in both general relativity [30] 
and quantum field theory [31]. It is a very general result that holds for any 
theory of physics based on action principle. 

The meaning of Jẹ depends on what the symmetry is, or in other words, 
what the a’s are. Let's look at some examples. 
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4.3 Spacetime Symmetries and Their Noether Currents 


The foundational symmetries of particle physics (indeed all of physics) are 
spacetime symmetries: symmetries that transform the coordinates of space 
and time. Let’s find the associated Noether currents for these. 


4.3.1 Spatial Translations 


We expect the fundamental laws of nature to be insensitive to a special lo- 
cation in space. This means that there is no special choice for the origin of 
coordinates. Under a spatial translation, the choice of origin is shifted, so 
this symmetry requirement means that the action should be translationally 
invariant. A translationally invariant action will have 


L|q- o,d|- L|gjd where q/=q+0 and q/ = q (4.14) 


where there are three symmetry parameters (01,02, 03) that parametrize the 
translation distances. Equation (4.14) implies 2 L a=o = 0 and so we can set 


G — 0 (or a constant vector; it doesn't matter). We also have 


Od, 
‘= 06; 4.15 
Oar i ( ) 
and so the Noether current J} = is 
RE Od; ðq’ 
Jere p ti Clg = p; (Sin) -0— 4.16 
k Pio (a Ta a (dix) Pk (4.16) 
This means that? E ja 
J= go P Ig (4.17) 


dt dt 
which is the conservation of linear momentum! Hence 


space translation invariance = conservation of linear momentum 


4.3.2 Rotations 

What happens if we insist that our system be rotationally invariant? Rota- 
tional symmetries are described by the orthogonal group SO(3). In Chapter 
3 we saw that these take the form 


ü = (exp [ia - A). qj 2 (/ — ear) qj = qi — (à x q); (4.18) 


FNote that if G were set to be a constant vector, then this would have the effect of shifting 
the definition of conserved momentum by some constant (and irrelevant) value; momentum 
would still be conserved. This is why we choose G = 0. 
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where the (small) symmetry parameters (01, 02,03] are the angles of rotation 
around each orthogonal coordinate axis, the J's are the rotation generators 
we looked at before, and (91,45, 93] are the 3 spatial coordinates. Hence 


Od; jk kj 
an e CER ESC té" dj = €ikjdj (4.19) 


Since the magnitude of a vector is invariant under rotations, a rotationally 
invariant Lagrangian will only depend on the magnitude of q, (ie. L = 
L (|d])), so we will have L (|g|) = L(|g]) (ie. 25|. , = 0) and we can once 
again choose G = 0. 

The Noether current J;> is thus 


3 Od; E E 
JP = pi Ja, P (cixjdj) = —exigpidj = — (P X d = (qx p), (4.20) 
and we obtain , r 
— J = 0> — (q = 4.21 
dr 0 di (qx p) =0 (4.21) 


which is the conservation of angular momentum! Hence 


rotational invariance — conservation of angular momentum 


4.3.3 Time Translations 


We expect that the same laws of physics were operative yesterday as they 
are today, so it is reasonable to require invariance of the action under time 
translations. Under a small time translation t = t + da, 


ðq; 
ða 


dad = q(t + da) — q(t) > = di (4.22) 


a=0 


There is now just one symmetry parameter {a}, which parametrizes how much 
the origin of time has shifted. We will also have a non-zero G because 


dL OL dL 
dt Oo? 0 dt 


(4.23) 


a= 


which tells us that G = L — Lo, where Ly is an irrelevant constant that we 
can set to zero. Hence the Noether current J***"* is 


Od, 
JA = A L =piqg,-L =H 4.24 
(0 2 ) IN pid (4.24) 
qdJjt trans dH 
dt dt ( ) 


which is the conservation of energy, since H = pij; — L is the Hamiltonian. 
Hence 
time translation invariance = conservation of energy 
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Time-translation will be a symmetry of any system whose Lagrangian does 
not explicitly depend on the time coordinate. After all, if the Lagrangian 
did depend on the time coordinate, then the action would not be invariant 
under a time translation. This is a feature of all closed systems: that there is 
no possibility of the system being externally modified at some later time(s). 
Consequently conservation of energy will hold for any closed system. 


4.4 Symmetries and Quantum Mechanics 


The preceding discussion was valid for systems of non-relativistic classical 
particles. The transition to quantum mechanics changes the situation in two 
ways’. First, it necessitates a reformulation of the symmetry-principle/con- 
servation-law relationship. Second, it allows for a broader class of symmetries 
called internal symmetries: symmetries that rearrange the charges and inter- 
nal structure of wavefunctions without transforming space and time. 

A physical system (or particle) in non-relativistic quantum mechanics is 
represented by a wavefunction Y that obeys Schroedinger’s equation: 


ih— =Hv(t) (4.26) 


where H is a quantum operator that is constructed from the classical Hamil- 
tonian H = p;q; — L. Already we see a difference from the classical discussion 
in that H (and not the Lagrangian) is now playing a fundamental role. For a 
stationary state U,(t) = exp (—iEt/h) y (q), the eigenvalues of H correspond 
to the allowed energies (the allowed values of E) that the system can have. 

In passing from classical to quantum mechanics, we replace q(t) —> W(t, q), 
so we expect that a symmetry operation is described as 


q; = gila, al — Y(t; q’) = Ula) (t; q) (4.27) 
where U(o) is a matrix that acts on the (possibly multicomponent) wavefunc- 


tion V. Since U(a) represents a symmetry of the system, the normalization 
of the wavefunction should not change and so 


farma q)-1 


E f dat’ (t; q)U' (o)U(a) Y(t; q) = J dat’ (t; q)u (tq) — 1 (438) 


8The transition to relativity, while preserving the basic conservation laws in section 4.3, 
introduces a change of another kind that we will examine in Chapter 11. 
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which means for any symmetry operation represented U(o) we must have 
U'U =I (4.29) 


so that the normalization is preserved. Hence a symmetry in quantum me- 
chanics must be represented by a unitary transformation on the wavefunction: 
U(o) must be a unitary matrix. 

How are conserved quantities associated with symmetries in quantum me- 
chanics? Based on the classical discussion we expect that a symmetry oper- 
ator U(a) corresponds to a conserved quantity w that is observable. Specifi- 
cally, consider 


ge / dq V (t; qJU(a)W(t: q) (4.30) 


and compute its time derivative 


dw out ow 2 
zz 1 = T T 
¿= fda ( * Uv + YU r) =; faa wno V'UHV) 


I 


; [ v [H, U] V (4.31) 


where equation (4.26) was used. We see that the quantum number w will 
be conserved (i.e.% = 0) provided [H, U] = 0. So the quantum version of 
Noether's theorem is 


Conserved quantum number = [U, H] = 0 © U is a symmetry of the system 


The set of matrices {U(a;)} that form a group is called the symmetry group 
of the Hamiltonian. 

Note that the matrix U need not be connected with any spacetime transfor- 
mation (e.g. rotations, translations) — it might simply represent a symmetry 
that mixes up the wavefunction components. An example of this would be a 
symmetry that rearranges the color charges of quarks. We will see later that 
such internal symmetries underlie all the non-gravitational forces of nature. 

Let's look again at our spacetime symmetries in this context. 


4.4.1 Spatial Translations 


We expect that since the classical Noether current is p, the symmetry operator 
associated with space translations is (—iV) once we make the identification 
P — (—ihV) from standard quantum mechanics. This is indeed correct. 
Invariance under a translation means that 


q/ —q-a > VW(q') = V(q) > V'(q) = V(q— o) (4.32) 


and expanding this for small a yields 


V (a) = Wa 04) = Va) - (à- V) va) 
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and so the symmetry operator is P = -ihV (where Planck's constant h 
ensures that the units are correct on both sides of this relation). We thus 


must have 
E P.H| =0 (4.34) 


for a translationally invariant system. This is conservation of linear momen- 
tum in the direction a. 

Since @ is arbitrary, we can choose it to point in any one of the three 
orthogonal directions, in which case we have 


[P;,H] = 0 (4.35) 


for any given direction. Since the various components of P commute with 
each other (because V;V; = V;V;), in a translationally invariant quantum- 
mechanical system components of the linear momentum can be defined along 
all of the measurement axes. 


4.4.2 Rotations 


Under a rotation of angle œ around an axis ù (@ = offi), a one-component 
wavefunction V transforms as 


MEN j s 
q = (exp ia ; j|, gj; =R q; — W(q) = V(q) > V'(q) = V(R^'q) 
(4.36) 
and expanding this for small @ gives 


V (gi) = Vlg + (Ex Di) = Va) + (x 7) - V) Wa) 


= (reis (7x (-i¥))) wa) (437) 


and so the symmetry operator associated with rotations is L = x (—ihV). 
Note that this is equivalent to the Noether current q x p once we make the 
identification that p — (—ihV) = P from standard quantum mechanics as 
before. Hence a rotationally invariant quantum system must obey the relation 


E L.H| =0 (4.38) 


which we recognize from 3rd year quantum mechanics as conservation of an- 
gular momentum L around the axis à. 

Unlike the situation for translations, the different components of L do not 
commute with each other. This leads to a subtle but important distinction 
from the classical case. In the classical case, we saw that d x p was con- 
served, and so angular momentum around any axis is classically conserved 
in a rotationally invariant system. Since in quantum mechanics the different 
components of L do not commute with each other (even though any specific 
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component commutes with the Hamiltonian H), in a rotationally invariant 
system the angular momentum can only be defined about a chosen axis. The 
choice of this axis is arbitrary (reflecting the classical conservation about any 
axis) but — once chosen — the remaining components are rendered inaccessible 
to measurement. 

Let’s look at the implications of this non-commutativity in a bit more detail. 
For any function f(x,y,z) 


[Ls Ly] f(2,y, 2) = [c -in¥)) (gx (int) 


TP 0 0 Of Of 

| ðr Oz) V Oz Oy 

o? f of Pf Ef 
— fe - ae TM 
zi E (is xác) ee (5 72) 


+2 ( PF PF 


OyOu = OxOy 
Pf 0% DT Of 
PAS OyOz Ozdy Ox "By 
ð ð 
— k2 Sag 
= (27 195)! 
= ihib,f (4.39) 


We can repeat this, cycling through (x,y,z) and arrive at the relation 
[L*, L^] = ire" trt (4.40) 


which is the same kind of relationship between the L's that we had for the 
J's in Chapter 3 (see eq. (3.13)). We will make use of this in subsequent 
chapters. 


4.4.3 Time Translations 


This is a special case in quantum mechanics, since time plays the role of an 
ordering parameter. Under a time translation t = t + q, 
(yl 1 QV 1 
V'(t) = V(t) > V(t) = V(t— a) 2 V(t)—a a 14 zeH v (4.41) 
which means that the Hamiltonian H is the symmetry operator. This triv- 
ially commutes with itself, and so energy is always conserved in a quantum 
mechanical theory. 


74 An Introduction to Particle Physics and the Standard Model 


4.5 Summary 


Under a symmetry glo]: 


INVARIANCE 
of the ACTION = conservation of Jp = (v; ae — G) 
are=0 
COMMUTATION 
[U(a), H] = 0 — conservation of w = f dq V! (t; qQ9JUW(t; q) 
Specifically, for spacetime symmetries: 
Rotational Invariance — conservation of angular momentum 


Space-translation Invariance => conservation of linear momentum 
Time-translation Invariance = conservation of energy 


4.6 Questions 
1. Suppose an operator F has real expectation values, i.e., (4 | F'| 4) is real 
for any wavefunction 4. Show that F is Hermitian, i.e., that F! = F. 


2. Consider the following action in classical mechanics 


s= fe 205] - vn 


(a) Suppose we want to make a transformation that rescales the coordi- 
nates by a constant factor of c, i.e. Z’ = o Y. How must the time rescale 
in order that the action remains invariant if the potential V(z) = 0? 


dí 
dt 


(b) Under what circumstances is the action invariant under this trans- 
formation if V(x) 4 0? Find the general form of the potential. 


(c) Find the Noether current associated with this transformation and 
show that it is conserved when the equations of motion are satisfied. 
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3. Consider a system of N particles, whose wavefunction is V (q1, q2,---, qn). 
If this system is invariant under translations, find the associated Noether 
current. 


4. Consider an algebra consisting of the set of operators {P,, Py, Pz, Lz, Ly, Lz} 
with the combining operator being the commutator. Does this algebra 
close? 


5. Show that the operators P . P and L- L commute with all elements of 
the algebra in question #4. 


6. Consider the operator U — exp (ja: P) where P = —ihV is the 


momentum operator and a is a vector displacement from the origin. 
How does U act on a wavefunction Y (z, t)? 


7. Consider the operator U = exp (-ià : E) where L = 7 x (—¿RV) is the 
angular momentum operator and fi is a unit vector. How does U act on 
a wavefunction V (z,t)? 
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One of the uses of symmetries in particle physics is to classify the possible 
types of particles. Intuitively we conceive of a particle as a tiny, possibly 
indivisible bit of matter. However, we know from quantum mechanics that 
it is described by a wavefunction, and is in a certain sense delocalized over a 
region of space at any given instant. 

But what kind of wavefunction should we use for a given particle? Clearly 
we can’t use the same kind of wavefunction for each particle because they 
have very distinct properties. Electrons have small mass and negative charge, 
quarks have color and large mass, pions can be charged or neutral and have 
no spin, etc. A single type of wavefunction could not properly describe these 
distinct properties. 

This is where symmetries come in. Symmetries provide a framework that 
constrains the types of wavefunctions we can use to describe particles [32]. 
Intuitively, the wavefunctions that we use should covariantly transform with 
respect to the fundamental symmetries of nature that we believe (on empirical 
grounds) to be valid. From this perspective we then ask given a symmetry, 
what particle wavefunctions can logically exist? And what are their charac- 
teristics? In this chapter we will consider these issues. 


5.1 General Considerations 


To answer these questions, the first thing we need to specify is the system. If 
we want to classify a particle, the system under consideration should be just 
the particle and nothing else. This means that the action for the system is 
just the kinetic energy of the particle — no other interactions or potentials are 
present! 

Let’s work non-relativistically to start with. In this case we have 


1 lazl? 
Classically : s= [fs jm ^ | = R fez 12? (5.1) 
Y 2 
Quantum Mechanically : us = Y we) (5.2) 
ot 2m 
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for a free particle of mass m. 

This system is clearly invariant under all spacetime symmetries: space 
translations, time translations, and rotations. Hence we can classify a free 
particle by its properties under these symmetries. Note that the particle 
might be composite (e.g. the pion, the proton, the Kaon, etc.) or elemen- 
tary* (e.g. electron, up quark, photon, etc.). In either case, as long as the 
particle is represented non-relativistically by (5.1) or (5.2), our classification 
will be valid. 

Consider first translations. Invariance under spatial translations means 
that the momentum of a free particle is always conserved, and invariance 
under time translations means that its energy is conserved (both classically 
and quantum mechanically). In the rest frame of the particle the momentum 
vanishes. Hence every free particle is characterized by its energy in the rest 
frame — which is its mass!. We say that the mass of the particle (by which 
we mean the inertial mass) is a good quantum number of the system. 

Rotations are somewhat less trivial. We’ve already seen that a rotation 
is a transformation R for which RR = I, i.e., the group is the group of 
orthogonal transformations of 3 x 3 matrices, which we call SO(3). Such 
transformations preserve angles between vectors and their lengths. We saw 
earlier that they can be written in the form 
R =exp [id J| > RI = (exp lið. 3|) : (5.3) 


a 


where the J operators obey 
[15,3 adn (5.4) 
and also (as a consequence of (5.4)) 
[[J*, 5°] ,3*] + [25,37] ,3*] + [[35,3*],3*] =0 (5.5) 


with a = 1,2,3. We also saw in the previous chapter that L =z x (—ihV) 
TUE 
satisfies the relations (5.4) — that is, setting J" = L* = hi (z x (-i¥)) for 


*By definition elementary particles are indivisible — they are not made of smaller compo- 
nents. So how do we know if a particle is elementary or not? The answer is that we don’t! 
What we can do theoretically is to assume a given particle is elementary and work out the 
consquences. To the extent that experiment is in agreement with theory, this assumption 
is valid. For example all experimental evidence to date indicates that the electron behaves 
like an elementary particle for distances no smaller than 10720 cm [33]. Future experi- 
ments that can probe even shorter distances might uncover evidence that the electron is 
not elementary — if so, then our theory of the electron would have to be modified [34]. 
Since we are working non-relativistically, we actually can’t conclude that the energy in the 
rest frame is the mass. However, this result would follow were we to work relativistically 
(which we’ll do later on). Note that even non-relativistically we must specify the parameter 
m in order to write down the action or Hamiltonian — since m commutes with H, it can be 
used to characterize (i.e. classify) the particle. 
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a = 1,2,3 will satisfy (5.4), with an appropriate factor of A inserted. It is 
easy to check that (5.5) is also satisfied — indeed, it must be if (5.4) holds. 

The nice thing is that any set of N x N matrices J? that satisfy the relation 
(5.4) can be considered as symmetry operators of SO(3) that act on some 
wavefunction V. Note that this turns the problem around. We originally 
began with the matrices 


000 00-1 0 10 
{T° T”, Ty =< —ifļ0 0-1.) ,-7] 00 0 | ,—2 | -100 (5.6) 
0-10 10 0 0 00 


and showed that they satisfied (5.4) by setting J^ = T* Now we want to 
consider (5.4) as the defining relation for rotational symmetry and find all 
possible matrices — of any dimensionality — that satisfy this defining relation. 

Recalling the definition from group theory, any such set will be a represen- 
tation of the rotation group. We only want the irreducible representations 
(the irreps). Each irrep will correspond to a possible way that a free quantum 
particle can manifest rotational symmetry. Hence the irreps of the rotation 
group classify free particles. Once we know these irreps, we know all the 
possible physically distinct particle wavefunctions! 

It’s a general problem in group theory to find these irreps, and I won’t do 
that here [35]. Instead, setting h = 1, PI just write down the most general 
solution to (5.4): 


EES a 
a9) = (zx (1D) & «(89 
= L'àff + (S°) * (5.7) 
or more succinctly, J=L+S. Non-relativistically this operator will act on 
an N-component wavefunction V (2,t) = (Wi (@,t), Vo (Z,t),---, Un (z,t)), 
where the K-th component is 


ie. the wavefunction is the product? of a spatial part y(7,t) and an N- 
component spin part x. The orbital angular momentum operator L acts only 
on p and the spin angular momentum operator S acts only on x , i.e. 


Jv= (Ly) x+y (Sx) (5.9) 
Note that rotational invariance implies via Noether’s theorem only that 


5, H] =0 (5.10) 


tRelativistically this does not hold — although it will hold in the non-relativistic limit and 
in the rest frame of the particle. 
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and that in general 
[LH] 20 [S,H] 20 (5.11) 


In other words L and S will in general not commute with the Hamiltonian, 
though they commute with each other. 

In the rest frame of a particle, there is no orbital angular momentum and 
all eigenvalues of L vanish. However, the eigenvalues of the spin (or intrinsic) 
angular momentum do not vanish. This gives us our second good quantum 
number for free particles: spin. 


5.2 Basic Classification 


So far we've seen that invariances of a free particle under time and space 
translations and rotations imply that its momentum, energy, and angular 
momentum are all well-defined (are “good quantum numbers”), and so can 
be used to classify a particle. In the rest frame of the particle these quantities 
reduce to the particle mass m, and its spin s. 

The mass is specifiable simply by giving a numerical value for m in whatever 
the relevant units are. We know of no principle that determines the particular 
values that m might have for a given elementary particle. The best we can do 
is to determine the value of m from experiment on a case-by-case (or perhaps 
I should say particle-by-particle) basis and input this value into our theories. 

'The spin s, on the other hand, is not so freely specifiable; instead it is 
determined from one of the irreducible solutions to (5.4): 


[8*, S9] = ie" (5.12) 


where J — S since we are in the rest frame of the particle. Any irreducible set 
of 3 matrices (S”, SV, S7} that solves (5.12) determines a possible s value of 
the particle. Hence knowledge of all irreps satisfying (5.12) (i.e. all possible 
S's) is equivalent to knowledge of all allowed particle spins. 

As noted above, finding these irreps is a problem in group theory. Table 
5.1 lists the three simplest (and most commonly used) irreps, along with the 
general form. 

The quantities o* in table 5.1 are 


ioo = {01, 0°, 0°} = { E jj ; (E a) b 237 (5.13) 


and the quantities T? in table 5.1 are 


QE TT) = TESTA T 
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TABLE 5.1 
Irreducible Representations of Spin 
Spin Irrep Terminology Transformation (Rest Frame) 
s=0 $=0 Scalar V=¢ =o 
m M b 
s—i S=30 Spinor Y = w V, = exp [10-4] v» 
a,b=1,2 
> E > PEE E 
s=1 S-T Vector Y=A A= exp [i9 T] ^ A; 
Piscis 
a : a LINGE 
E S Tensor V=xX yh, =exp E : 8 XN 
M 


M,N =1,2,..., (28 +1) 


000 00-—1 0 10 
-4é-i[o01]|,-i[00 o |,-¿| —100 | $65.14) 
0-10 10 0 0 00 


which are the generators of spatial rotations of a vector that we encountered 
N 


in Chapter 3. The general form of (S) is a square matrix of dimension 
(2s +1) x (2s + 1) which obeys (5.12). 

Note the implications of the results listed in table 5.1: the spin of an el- 
ementary particle must be either an integer or a half-integer (in units of f). 
There can never be an elementary particle with s = 9/7 or s = V3 or some 
other real number that is not one of these two types — rotational symmetry 
forbids this possibility?. 

All representations have the following two features in common: 


S^y = (s 5) U=s(s +1) (5.15) 
S^ = sq (5.16) 


where s, is a number that can have any value in the set (—5,—s + 1,—s + 
2,...,8— l, s), for a total of 2s + 1 values in all. 

The s — > representation (or the spinor representation) is of significant 
import in particle physics. It was found by Pauli [36], and the @’s are given 
the name Pauli matrices. You can find out more about their properties in 


8 Actually this statement depends on the number of spatial dimensions. In two spatial 
dimensions spin need not come in units of 7/;/2. An particle whose spin differs from some 
integer times h/2 is called an anyon, as noted in Chapter 1. 
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appendix D. They obey (5.12), i.e. 


1 1 j 
ES d = 50 + cyclic (5.17) 


and we see that the factor of > is crucial to this end. This factor implies 
that under a 27 rotation (about, say, the z-axis) the spinor representation 
transforms as 


= —Yo (5.18) 


So the wavefunction Y — —~ under a 27 rotation! Notice that for the same 
rotation the scalar and vector wavefunctions do not change sign (4 — ¢ 
and Á > A). It is this peculiar feature of half-integer spin particles that 
distinguishes them from integer-spin particles. 


What does relativity do to all of this? We could repeat our analysis by 
replacing R? with A y+ The preceding results do not change — the only kinds 
of wavefunctions allowed are those permitted by the rotation group. Including 
boosts adds nothing new to this. Instead, relativity has a different physical 
implication for the allowed wavefunctions, which we'll look at in Chapter 11. 


'Tables 5.2 and 5.3 summarize our current knowledge of which elementary 
and composite subatomic particles have which spins. Question marks ap- 
pear beside particles hypothesized to exist, but which have not actually been 
observed ll, 


Note that [o, o?| = 2io? & cyclic. The factor of 2 is what destroys agreement with eq. 
(5.12). 

You might have noticed that there are no elementary particles with spins larger than 2, 
and that there appears to be no elementary particle of spin 3/2. There is a theoretical 
obstruction to writing down theories describing pointlike elementary particles spins larger 
than 2 — nobody knows how to get them to interact according to standard approaches in 
quantum field theory. Superstring theories do not have such obstructions — see chapter 25 
for more details. 
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TABLE 5.2 
Classification of Bosonic Particles 
Spin-0 Spin-1 Spin-2 
Elementary Higgs? Photon, Gluon, W, Z Graviton? 
Composite Pseudoscalar Mesons Vector Mesons ?? 
TABLE 5.3 
Classification of Fermionic Particles 
Spin-1/2 Spin-3/2 
Elementary Quarks, Leptons Gravitino? 


Composite Baryon Octet Baryon Decuplet 


E: Se eee 


5.3 Spectroscopic Notation 


It’s common to write the general (non-relativistic) spin-s wavefunction as x, 
where 


X= |ssz) where s; = —s,...,5 (5.19) 
so that 
S? |s s+) = s(s + 1) |s sz) (5.20) 
S? |s s2) = sz |S s2) (5.21) 
S* |ss;) = ys(s +1) — s,(s; +1) |s (s; + 1)) (5.22) 


where the matrices S have been written as 


SE = S7 iS” = Stis? and S7 = S? (5.23) 
S? = (S*)? + (SV)? + (57? (5.24) 


In addition to its mass and spin a particle (e.g. if it is composite) may 
have other good quantum numbers (“good” because they are constants of the 
motion and so their associated operators commute with H). For example, it 
is very common that 


IL L,H] =0 IS us H] =0 (5.25) 


which physically corresponds to assuming that there are no forces or inter- 
actions that can change the magnitude of either the spin or orbital angular 
momentum (although there may be forces or interactions that change their 
directions). So more generally we write 


J|jm) = 5G +1) lim) (5.26) 
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J |j m) = m |j m) m=—j,-j+1,...,9 -1,9 (5.27) 
J? |j m) = Jj +1) 2 m(m x 1) |j (m 1)) (5.28) 


for a particle (composite or not) of total spin j and z-component m, with the 
J's defined analogously to the S"s. When (5.25) holds, a particle is charac- 
terized by the quantum numbers given in table 5.4 in addition to the mass. 


TABLE 5.4 

Particle Quantum Numbers 
Quantity Operator Eigenvalues 
total angular momentum J-J jg +1) 
total spin S.S s(s 4- 1) 
total orbital ang. mom. L.L £(£ 4- 1) 
Axial component of ang. mom. J” Ja =m 


We summarize this information using spectroscopic notation[37] 


2S+1 Ly 


for a state of total angular momentum j = J and spin s = S. The numerical 
values £ of L are often denoted by S,P,D,F (for historical reasons**) instead 
of 0,1,2,3. Note that L can have only integer (and not half-integer) values. 
Hence a particle in the state 3S4 has s = j = 1 and £ = 0; a particle in the 
state °D3 has s = 2, j = 3 and l = 2. 


5.4 Adding Angular Momenta 


When particles collide they produce resonances (short-lived bound states), 
which in turn decay into other particles. These resonances will have spins 
that will be determined by the intrinsic spins of the colliding particles and 
their relative angular momenta. For example, the vector mesons are bound 
states of a quark with an antiquark in a spin-1 combination. 

Consequently a key general question in particle physics (as well as in atomic 
physics) is how, quantum mechanically, do we add two (or more) angular 


**The letters, “S”, “P”, “D”, and “F”, for the first four values of £ respectively stand 
for “Sharp”, “Principal”, “Diffuse”, and “Fundamental”, based on the properties of the 
spectral series observed in alkali metals. 
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momenta? Of course we know the answer classically: we just add the com- 
ponents. Quantum mechanically this doesn't make sense, because we can't 
measure all components of J simultaneously. A measurement of J^ neces- 
sarily alters J¥. The best we can do is to simultaneously measure J?— J.J 
(the magnitude J ) and one component of J, since each component commutes 
with J?. By convention we choose this component to be J*. This is why 
we often label angular momenta of wavefunctions in the form |j j;) or, more 
commonly, as |j m). 

So what can we do to describe how particles of different spin combine? 
The answer (as you might have guessed) is also found in group theory and is 
called the Clebsch- Gordon Decomposition (or CG decomposition!!). Given 
two particles of angular momenta |j; mı) and |j; m2), they can combine in 
the following linear combination of angular momentum states 


jitje 
[ji m1) 9 [jo ma) = 5 C), (ja, jo; Mmi ma) |j m) 
j-|j1—32l 
= Ch s. ll — jal mı + ma) 


MO ema Mi fab Lm + ma) e 
+0492) (i + ja),mi + ma) — (5.29) 


T;-r mo 


where the value of m is always the sum of the incoming ms, i.e. m = m4-- m». 
The left-hand side of eq. (5.29) is the product of two spin wavefunctions 
corresponding to angular momenta jı (with z-component mı) and ¿a (with 
z-component m»). The right-hand side of eq. (5.29) is a linear combination 
of all possible spin-wavefunctions that are permitted by the rules of quantum- 
mechanical angular momentum conservation. 

This rather formidable looking notation is more easily understood by noting 
that the CG decomposition has three essential features. 


1. Since we can always measure (by convention) J^ unambiguously, the 
z-components of angular momenta just add, as noted above, i.e. m = 
m4 + ma. So every wavefunction on the right-hand side of eq. (5.29) 
has z-component Mı + m», as is clear from each term in the sum. 


2. The magnitudes of the angular momenta do not add: the total magni- 
tude depends on the relative orientation of the incoming angular mo- 
menta J; and J2. Since this is empirically unknowable (because we can 
at best measure a component of each J along only one axis), we get all 
possible spin-wavefunctions in the linear combination on the right-hand 
side of eq. (5.29) that are group-theoretically allowed. The biggest 


tt The coefficients are named after two German mathematicians Alfred Clebsch (1833-1872) 
and Paul Gordan (1837-1912), who encountered an equivalent problem in invariant theory. 
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value for J is when Jy and Ja are parallel (giving j = jı + ja) - both 
angular momenta are aligned along the z-axis. The smallest is when J, 
and J» are antiparallel — both are anti-aligned along the z-axis, giving 
j = |j — ja|. All other possible values for J are given by going between 
these values in integer steps 


j =|ji — Gal lia — Jel + 1, lí —j21+2,..., ji +j2—1, jı + Je (5.30) 


3. The CÍ, ’s are numbers that depend on the input parameters ji, j2, M1 
and ma. A book on quantum mechanics will tell you how to calculate 
them [38], and the numbers are listed in tables known as Clebsch-Gordon 
tables. They are included in the appendix to this chapter. Reading these 
tables takes a bit of practice. The total spins being combined are given in 
the upper left of one of the sub-tables. The m-values (or z-components) 
of these spins are given in the lower-left boxes in a subtable, and the 
possible output |j m) wavefunctions are in the upper right boxes in the 
same sub-table. The CJ,'s are the square roots of the numbers in the 
relevant middle boxes, where the minus sign (if there is one) goes outside 
of the square root. 


5.4.1 Examples 
5.4.1.1 Glueballs 


A bound state of two gluons is called a glueball. What possible spins can the 
lowest-energy glueball states have? 


Gluons have spin s = 1, so the possible total spin values range 
between 1 — 1 = 0 and 1 +1 = 2. Since the states are of lowest 
energy, the gluons must have no orbital angular momenta. Hence 
the possible spins are 0,1 and 2. 


5.4.1.2 Positronium 


An electron and a positron can form a bound state called positronium. What 
are the possible spins of a positronium “atom” if the et and e^ have relative 
orbital angular momentum 1? 


'The spin j of positronium will be given by combining the spins of 
e* and e^ and their relative orbital angular momentum. Each 
spin s — 3, so the combined spin (without taking orbital angular 
momentum into account) is either 0 or 1. Since £ = 1 we get 
j —0,1,2 if the combined spin is 1. If the combined spin is 0 we 
get 7 =£+0=1. 
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5.4.1.3 Capture 


An electron is temporarily captured by an Q* particle, forming a resonance. If 
the z-components of their spins are positive and equal, what is the probability 
of observing this resonance to have its maximal angular momentum allowed 
in a state of lowest energy? How does this answer change if the z-components 
of their spins are equal and opposite? 


For this problem we need the CG tables. The Qt has spin 3 and 


the electron has spin i so we need the 3 ® i sub-table. Since 


we are in a state of lowest energy, £ = 0 and so orbital angular 
momentum makes no contribution. Since the z-components of the 
spins are positive and equal, we must have m — i for each particle, 
because this is the only allowed positive z-value for the electron. 


Reading from the 3rd line of the lower-left box in the 3 ® i table, 


we have 5 
31 11 3 
Q p 
53)° 33) 


1 
=~" 121 11 


where the coefficients are found by taking square roots of the num- 
bers in the central box to the right, and the wavefunction compo- 
nents are found from the box above this one, column by column. 
The maximal angular momentum state is therefore |2 1), i.e. j = 2. 


2 
'The probability of observing it is (8) = 75%. If the spins are 


equal and opposite we have, from the 5th line of the lower-left 


boxes in the 3 @ 3 table, 


31 1 1 1 1 
3) 215-3) = e tno 
and so the probability of observation of the |20) state is now 


(E) = 50%. 


5.5 Appendix: Tools for Angular Momenta 
5.5.1 Pauli-Matrices 


The 3-Pauli matrices are 


tao Lo de uu) = { e y , e a) ; E 2 (5.31) 


and we don't distinguish between upper and lower indices, so that c! = 


01,0? = 03,0? = og. We have the product rule 


O;0; = oil + TEGET k = T + Vei Ok (5.32) 
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where the 2 x 2 unit matrix J is often suppressed, as in the 2nd part of the 
expression above. This rule implies 


(c1) = (02) = (o3)? 21 
0102 = ic3 and cyclic (5.33) 


[ci, 05] = 2i€ijs0 [oi oj} = 284; 
and for any two vectors d and b: 
(6-3) (0-5) =a-b+i(ax5)-0 (5.34) 


We also have the exponential relation 


exp [e z| = > G = cos 0 + iĝ- &sin 0 (5.35) 
n=0 i 


where 0 = 00. 


5.5.2 Clebsch-Gordon Tables 


Clebsch-Gordon tables contain explicit formulae for all the CJ, (51, j2; m1 ma) 
coefficients given in eq. (5.29). I have reproduced them on the next two pages 
and in appendix F. 

The total spins (71, j2) being combined are given in the upper left of one 
of the sub-tables. The respective (m4, mz2)-values (or z-components) of these 
spins are given in the lower-left boxes in a subtable, and the possible output 
|j m) wavefunctions are in the upper right boxes in the same sub-table. The 
CÍ ’s are the square roots of the numbers in the relevant middle boxes, where 
the minus sign (if there is one) goes outside of the square root. 
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5.6 Questions 


1. 


(a) A A* with j; = —1/2 collides with an Q^ with j, = +1/2 temporar- 
ily forming a resonance. What is the probability that the resonance is 
spinless? What is the probability that the resonance has spin-2? 


(b) A spin-2 glueball decays into a quark-antiquark pair. What are 
the possible values of the orbital angular momenta of the resultant qq 
system? What is the parity of the glueball? 


. A clever experimentalist figures out how to collide two 2 particles of 


opposite charge into a resonance (i.e., a short-lived bound state) called 
an omegaball. 

(a) What are the possible spins of an omegaball in its lowest-energy 
state? 

(b) The experimentalist then figures out how to make a supply of 10,000 
omegaballs of lowest energy by repeatedly colliding Qt and Q^ together 
with opposite spins along the z-axis. What must these spin components 
be if more highest-spin omegaballs are made than any other kind? Find 
approximately how many of each spin are made in the sample of 10,000. 


(c) How does your answer to (b) change if the highest-spin omegaballs 
are fewest in number in the sample? 


A-particles are produced by a pion beam in the reaction 
T Hp K°+A 


and are observed via their decay A — 7^ +p. You are an experimentalist 
trying to determine the total spin s(A) of the A. The angle of the decay 
products relative to the beam axis is 0. 


(a) If the A is produced exactly along the beam axis what are the possible 
values of s,(A)? 


(b) A trustworthy theorist tells you that s(A) can't be larger than 3. 
Given this constraint, what are the possible angular decay distributions 
for the forward-produced A's as a function of their spin? 


Consider the Hamiltonian 


where B is an external magnetic field, g, yy are constants, and Ho com- 
mutes with the angular momentum operator J. If the magnetic field is 


pointing along the z-axis, find the commutator El z J]. 
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. (a) Find the simplest irreducible representation of the gravitino, a spin- 


3/2 particle hypothesized to exist in superstring theory. 


(b) Suppose two gravitini collide to form a bound state (a gravitiball). 
What possible spins could this bound state have? 


(c) Suppose you had 10,000 gravitiballs of lowest energy, formed by 
repeatedly colliding two gravitini together with opposite spins along 
the z-axis. What must these spin components be if more highest-spin 
gravitiballs are made than any other kind? Find approximately how 
many of each spin are made in the sample of 10,000. 


. Verify the following: 


(a) exp |ë. z| x un (62) = cos 0 + iĝ - sin 0 


(b) 


(c1)? = (02) = (o3)? 21 
0102 = 103 and cyclic 


[ci. 05) = QE; ROK 101,05] = 204; 


. In 1932 the decay of the neutron into a proton and an electron was 


observed. What conservation laws did this decay violate, if any? 


. Suppose an electron is in the state  — 0 5 +8 (5) 


(a) What relationship must o and 8 obey in order for i? to be normal- 
ized? 


(b) What values might be obtained upon measurement of Sy, and what 
is the probability of each? 


(c) What values might be obtained upon measurement of S4, and what 
is the probability of each? 


(d) What values might be obtained upon measurement of S,, and what 
is the probability of each? 
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Discrete Symmetries 


DOI: 10.1201/9781420083002-6 


Some symmetries in nature are not continuous, but discrete: you either per- 
form them or you don't. In other words they can't be generated from in- 
finitesimal transformations. Because of this the conservation laws associated 
with such symmetries are multiplicative instead of additive. In this chapter 
we will examine the notion of discrete symmetries and their implications for 
particle physics. 

In general, a multicomponent wavefunction V will transform under a dis- 
crete symmetry as 


V'i (2) = (Up),  Vpg(x) where g” = (nz) (6.1) 
=> Wi(z)- (Up), UÉ(Ú,*z) (6.2) 


where Up is some matrix (acting on the components of the wave function) 
that is to be determined for each discrete symmetry, and Up is its represen- 
tation when acting on the spacetime coordinates. For a continuous symmetry 
U = I+e(something) where e is small. This kind of Taylor-series expansion 
is not possible for a discrete symmetry. The matrices Up must be fully known 
in order for their action on wavefunctions to be explicitly computed. 

Fortunately this is not difficult to do. I won't deal with all possible discrete 
symmetries here, but instead will concentrate on the three most important 
for particle physics: parity, time-reversal, and charge conjugation. Let's look 
at each. 


6.1 Parity 


Parity is the act of reflecting a system in a mirror. If the mirror-system has 
all the same physical properties as the original, then we say the system is 
invariant under parity. 

Mathematically this kind of reflection involves specifying a plane for the 
mirror and then switching the signs of all the coordinates in the directions 
orthogonal to this plane. An example is shown in fig. 6.1. This is generally 
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quite inconvenient, so usually a parity transformation is implemented by per- 
forming an inversion on the coordinates Y — —z: every point is carried to its 
diametrically opposite location through the origin. This is a combination of a 
reflection with a 180? rotation. Both transformations turn a right hand into 
a left hand and vice versa. Inversions are easier to work with, since we don't 
have to choose a plane for the mirror. We shall generally refer to inversions 
as parity transformations. The difference between the two is illustrated in 
figures 6.1 and 6.2. 


FIGURE 6.1 
Reflection in the x-z plane: (x,y,z) — (a, —y, 2) 


FIGURE 6.2 
Inversion: (x,y,z) — (—2,—y,—2z) 
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So if we call the parity operator P, we have 


P: (t,2,y, 2) > (t,-2,-y,-z) > P (V) Sr (6.3) 
where V is any vector. Note that 
p(V-W)=pP(V)-p(W)=(-V)-(-W)=V-W (64) 
but 
p(VxW) =P(V) xP(W) = (-V) x (-W) =40 xw (6.5) 


which we expect, since P changes right-handed coordinate systems to left- 
handed ones. 

So we see that there are two kinds of vectors: those that reverse sign under 
P and those that do not. We call this 2nd kind of vector a pseudovector, since 
it transforms under parity opposite to the way a vector transforms. Note that 
if V is a vector and A is a pseudovector then 


P (V x A) = (-V) x (+4) = —V x A= a vector (6.6) 
P (V : A) = (=P) : (+4) =-V-A= a pseudoscalar (6.7) 


and so tensor quantities (scalars, vectors, etc.) may or may not be pseudo, 
depending on how they transform under P. The various possibilities are listed 
in table 6.1. 


TABLE 6.1 
Behavior of Scalars and 
Vectors under Parity 


Scalar P(s) = +s 
Pseudoscalar P(p) = —p 
Vector P(V)--V 
Pseudovector P(A) = +A 


If P is applied twice we must get what we had originally, and so 


P =I (6.8) 


or in other words the eigenvalues of P are +1. Scalars and pseudovectors 
have eigenvalue +1, whereas pseudoscalars and vectors have eigenvalue —1. 
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Note also that we can write a representation of P as a matrix when it acts on 
spacetime coordinates: 


1 
P: (52,32) > (572 79 2) = (Ae), = | y (6.9) 
0 


This shows that Ap is also a Lorentz transformation, since gag (Ap)5, (Ap)^, = 
Juv, Something easily shown by explicit calculation for each component. 

So in addition to their mass and spin, wavefunctions of elementary particles 
are also classified according to their parity eigenvalues. Let's look at a simple 
example. 


6.1.1 Parity of the Photon 


What is the parity of the photon? We can figure it out by recalling that 
in the hydrogen atom the wavefunctions (or rather, the spatial part of the 
wavefunctions) have the form 


V (2) = V (r, 0, p) = 9(r) Yi" (0.9) (6.10) 


where the Y" (0, 9) are the spherical harmonics, given in table ??. Now note 
that 


P : (t,£, y, z) = (t, =T, Y, +2) 


—P:(t,r,0,0) > (t,r, 70,7 +0) (6.11) 
> P [Y (2)] = Y (-2) = e(r)Vr (x — 0,7 + 9) 
= (0 «(Y (6,6) = (-1)* v (z) (6.12) 


and so we see that a state with orbital angular momentum / has a parity 
eigenvalue of (T This means that the S,D,G,... states of the Hydrogen 
atom (the £ =even ones) have even parity, whereas the P,F,H,... states 
have odd parity. In a transition where A£ = +1, one photon is absorbed (or 
emitted). Hence (since electromagnetism is parity-conserving) the photon has 
negative parity. 


6.1.2 Parity Conservation 


We also assign positive parity to quarks and leptons (and negative parity to 
their antiparticles). Of course, this is a convention, and we could have chosen 
a reverse assignment; it won't matter as long as parity is conserved". 


*Thinking ahead to when we include relativity, we will find that we need to include antipar- 
ticles. It is possible to show that a fermion has a parity opposite to that of its antiparticle 
whereas a boson has a parity that is the same as that of its antiparticle. This result follows 
from quantum field theory, a subject beyond the scope of this text. 
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To the best of our experimental knowledge, parity is conserved in strong 
and electromagnetic interactions. However, it is not conserved in weak- 
interactions. Weak interactions affect all known particles including a neutral 
fermion of very small mass called a neutrino. All neutrinos are left-handed 
(and all antineutrinos are right-handed): their spins are always antiparallel 
to their momenta’. Figures 6.3 and 6.4 illustrate this concept. Nature is not 
mirror-symmetric — parity is violated! 


direction of 
motion 


rotation 


FIGURE 6.3 

A right-handed particle is one that rotates in the direction of the fingers of 
the right hand while traveling in the direction of the thumb. Similarly, a 
left-handed particle rotates in the opposite direction. 


6.2 Time-Reversal 


Macroscopic physics is generally NOT invariant under time-reversal; for ex- 
ample an explosion does not look the same if a movie of it is run backwards! 
This is what we refer to as the arrow of time. Unlike the spatial coordinates, 
which can be traversed in any direction we like, the time coordinate seems 


+This discovery was first made in the 1950s by Wu [39] and we'll look at it in more de- 
tail when we consider weak interactions in Chapter 19. Recent observations at Super- 
Kamionkande [41] and SNO [42] have revised the original understanding of neutrinos as 
zero-mass particles. There is now excellent evidence that they are very light-mass particles, 
in which case there are also right-handed neutrinos! Our best experimental information so 
far is that the right-handed neutrinos do not experience the weak interactions. 
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right-handed left-handed 


[O &O- 


FIGURE 6.4 

A diagram representing the relationship between momentum, spin, and hand- 
edness. The thick arrow points in the direction of the spin vector. The thin 
straight arrow is the momentum vector. 


to be one for which only a one-way journey (from past to future) is not only 
allowed, but required. Understanding why this should be the case is one of 
the outstanding puzzles of modern physics [43]. 

At a macroscopic level the asymmetry associated with the arrow of time 
is a consequence of initial conditions, which according to the laws of thermo- 
dynamics always become less ordered overall as time increases (though local 
regions can become more ordered as in, say, the construction of a building). 
We say that a physical system in combination with its environment experi- 
ences an increase in entropy. 

At the microscopic level of collisions between the fundamental particles in 
our example of the explosion, the situation is quite different. At this level 
we might expect time reversal invariance to hold because Newton's laws are 
time-reversal invariant. From this perspective the puzzle of the arrow of time 
reduces to the problem of the origin of initial conditions. 

Common quantities transform under time reversal T and parity P are shown 
in table 6.2. 

As with parity, we can represent T as a matrix when it acts on coordinates: 


—1000 
i 0 100 
1:09) (try) (a= | 0199 (613) 
0 001 
and it, too, is a Lorentz transformation: gag (ANS (Ar), = guy. Both 


fermions and their antiparticles transform the same way under time-reversal!; 
the same holds for bosons and their antiparticles. 


6.2.1 Detailed Balance 


Time-reversal is difficult to test — as noted above, since all physical systems 
experience time to move forward, we have no way of directly forcing a system 


*This is another result from quantum field theory. 
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TABLE 6.2 
Transformations of Common Physical Quantites under Time-reversal and Par- 


ity 


Quantity T P Comments 
Position 7 r -r 
Momentum p —p —p Polar vector 
Spin $8 —s s Axial vector 
(like Fx p) 

Electric field E E -E E=-VV 
Magnetic field B =B B like a ring current 
Magnetic 

dipole moment ji -ğü ü 
Electric 

dipole moment d d —d 
Longitudinal 

polarization $8. p S- p —8.p Chirality 
Transverse 


polarization re (pi x p») —s. (pi x p: ) S- (pi x p: ) 


to reverse its trajectory in the time direction (i.e., we just can't “run the 
movie backward"). 

However, we can take a particular physical reaction and run it in reverse. 
Consider for example the two-body scattering of a neutron off of a proton to 
form a deuteron and a photon. Under time-reversal we would have 


n+p >D+y —T— D+y— n+p (6.14) 


T-invariance, if it held, would force the rate for both processes to be the same 
for corresponding conditions of energy, momentum and angular momentum; 
this is called the Principle of Detailed Balance [44]. It is the most direct test 
of time reversal that we have. 

So far all experiments have indicated that time-reversal invariance is a sym- 
metry of the strong and electromagnetic interactions, with the principle of 
detailed balance holding for every known case for these interactions. Un- 
fortunately the places where we expect to see T-violation are in the weak 
interactions: after all, this is where parity is violated (and, as we will see, 
charge-conjugation invariance as well). Here the principle of detailed balance 
is extremely difficult to test. For example the weak decay of the A meson is 
A = p+ 7, so we would expect to test detailed balance via 


A>p+r << T>p+r— A (6.15) 
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'This would in principle form a check, but in practice the pion and the proton 
form many other states besides the A when they collide. This is because 
the strong interaction between the pion and the proton totally dominates the 
feeble weak interaction that would form the A. It is therefore pretty much 
impossible in practice to pick out the time-reversed reaction on the right-hand 
side of eq. (6.15). 

Can we do anything about this? Yes — we can precisely measure static quan- 
tities whose value should be exactly zero if T is a symmetry. Any empirical 
evidence that such quantities were nonzero would therefore be firm evidence 
that T is not a symmetry of nature. 

One such quantity is the electric dipole moment d of an elementary particle. 
If it were nonzero it would have to be either aligned or antialigned with the 
spin of the particle since there is no other direction available. But from table 
6.2 we see that an electric dipole moment does not change sign under time- 
reversal whereas the spin does, so any nonzero-value of d would be a signature 
of time-reversal violation. At present the best limits that we have are for the 
electron [45] and the neutron [46]: 


ldn| «6x10 9e cm — |d.| < 1.6 x 10777e cm (6.16) 


setting stringent upper bounds on T-violation. We'll look more at the situa- 
tion for the neutron in Chapter 25. 


6.3 Charge Conjugation 


Charge conjugation, denoted by C, transforms any state into a state with the 
same energy, momentum, spin and mass but with all other quantum numbers 
(the “charges”) reversed. In other words, C transforms each particle into its 
antiparticle, e.g. 


Clp) — [p) Gee (6.17) 


Obviously C? = +1 (just like T? = +1 and P? = +1), and so it also has 
eigenvalues +1. However,, unlike P, most particles are not eigenstates of C, 
because a particle is not the same state as its antiparticle (for example, a 
positron is not an electron). Any particle that is an eigenstate of C must be 
its own antiparticle, since the eigenvalue equation for C requires 


C |particle) = |antiparticle) = + [particle) (6.18) 


The photon is one such particle, and the 7°, n, 7, p?, ¢, w, and J/w mesons 
are also their own antiparticles. Consequently they can be assigned definite 
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charge-conjugation quantum numbers. By convention, C does not change the 
mass, energy, momentum or spin of a particle?. 


6.3.1 Charge Conjugation of the Pion 


If electromagnetic charge is reversed for all particles in a given system, then 
the sign of the electromagnetic field in the system must also be reversed, and 
so C =—1 for the photon. We can use this to deduce the value of C for the 
pion. For the 7°, experiments have shown that 


n? — 2y (6.19) 


which is an electromagnetic decay, because only photons are in the final state. 
Hence we must have C = +1 for the 7° since C is conserved in electromag- 
netism. 

The conservation of C is therefore something we can check by searching for 
the decay of the 7° into odd numbers of photons. Any positive evidence for 
such a decay would indicate that something was wrong with our understanding 
of electromagnetism and, by extension, the Standard Model. Perhaps our 
theory of electromagnetism would need revision, or perhaps the 7° would have 
a structure that is different from our current understanding. Experimentally 
we have 

T (1% => 37) 


oe 23 dese 1078 2 
Taray <31x10 (6.20) 


and so we see that C is conserved in electromagnetism, to better than one 
part in 10 million [47]. 


6.3.2 Charge Conjugation of Fermions 


A fermion-antifermion bound state of orbital angular momentum / and spin s 
must have C — (p E This can be deduced by noting that the lowest-energy 
state, which has s = £ = 0, can decay into two photons by energy conservation. 
Since C = —1 for the photon, we must have C = (C(photon))? = +1 for this 
bound state. If the bound state has s — 1, then the fermion-antifermion pair 
have the same spins: a spin-flip transition to an s = O state results in the 
emission of one photon, so C = (—1)* if £ = 0. Finally, if £ Z 0, the excited 
bound state can decay electromagnetically with £ — £ — 1 by emitting one 
photon. The only consistent choice that describes these observations is to set 
C = (-1)** for a fermion-antifermion bound state. 


Strictly speaking, the antiparticle of an electron with spin 3, and momentum p'is a positron 
of spin —5, and momentum —p. However, we are interested in how C changes the internal 
quantum numbers of a particle — the electric charge, the quark color, etc — and so we define 
C as an operator that reverses only internal quantum numbers of any given particle. 
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C is not a symmetry of the weak interactions: C applied to a left-handed 
neutrino gives a left-handed anti-neutrino (i.e. a neutrino with opposite “weak 
charge”). But left-handed anti-neutrinos do not couple to any known parti- 
cles", so C is not conserved for weak interactions. 


6.4 Positronium 


An ideal place to strengthen our understanding of discrete symmetries is with 
positronium. Positronium is an electromagnetically bound state of eFe” that 
can decay to photons: 

ete” —> 23,85,... (6.21) 
It is bound together just like the Hydrogen atom, except that a positron is 
responsible for the binding instead of a proton. The wavefunction is 


$ = U(r, 9, 6; u) =(s) = U(r, 9, $; u) (Ve- 8 Yer) (6.22) 


where V is the wavefunction of the Hydrogen atom, but with reduced mass 


Me me 2 


1 
L= ( L44 ) = tme. Its energy levels will be given by the Bohr formula 


Ey = PENE for n = 1,2,3,... where a = e ~ im is a dimensionless 
quantity called the fine- structive constant. 

The wavefunction V provides information about the relative spatial rela- 
tionship between any two particles regardless of their structure. The object 
E(s) = (w+ Q Ye-) is the spin part of the wavefunction, composed of the 
spins of the electron and positron. Since each of these are spin-1/2, they can 
only combine to give a total spin of 0 or 1. Using the Clebsch-Gordon tables 
??, we have 


? 


Rr m Veet triplet S = 1 

=1.0 = L (yt Lu riplet 5 — 

(1,0) — 75 Js (v1. e Vies) ORTHOPOSITRONIUM (6-23) 
=0 1) = vit 

= 1 pu singlet $ — 0 

AS a 75 (Vth uL.) PARAPOSITRONIUM 42% 


Now recall that charge-conjugation changes a spin-up (which I'll write as 
spin-]) e~ into a spin-f et. Hence under C: 


cpup-c[eLeh|-wheL--eLeh--s0,) (6.25) 


“It was once thought that they didn't exist, but experiments at the Sudbury Neutrino 
Observatory [42] strongly imply that neutrinos have mass, which means that both right- 
handed and left-handed neutrinos and antineutrinos exist. We will consider this subject in 
Chapter 25. 
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where the 2nd from last step follows because fermion wavefunctions anticom- 
mute due to the Pauli principle. Hence 


cpORTHO __gORTHO ana CoPARA _ ¿PARA (6.26) 


which you can easily check. Recalling that the photon is negative under C, 
we consequently find that 


gPARA __, oy Lay 4 6y4-° (6.27) 
pORTHO __, 34 4 5y 4 Ty ese (6.28) 


or in other words, PARA can only decay into even numbers of photons, and 
ORTHO only into odd numbers of photons!. A 2-Body decay process is 
"easier" to go to than a 3-Body one - the phase space is larger for decay 
into smaller numbers of objects — so we expect PARA to decay faster than 
ORTHO. Also, the amplitude for emission of one photon is proportional to 
the charge e of the electron. This means that the probability for PARA to 
decay will be proportional to |e?|? ~ a?. Since ORTHO decays by emitting 3 
photons instead of 2, we expect its decay rate to be smaller than PARA's by 
a factor of a. 

Note that positronium decay depends on the electron and positron annihi- 
lating each other, a situation that can only occur if they are in the same place 
at the same time. Hence the decay rate must be proportional to |W (0)]^, i.e. 
the square of the wavefunction at the origin, which is where the electron and 
positron “collide.” From atomic physics we know that 


1 aê 
v (0)? = — = 6.29 
VOP = gs (6.29) 
where a — ar is the Bohr radius of the positronium atom and re = E is 


the classical electron radius. The actual theoretical calculations give [48, 49] 


T (PARA > 27) = 4rher? |W (0)? = HE rie (1.252 x 107195) ' 
= 8.00 (nsec) ! (6.30) 
T (ORTHO > 33) = x (x? — 9) meta? = (1.374 x 1077s) ~} 
= 7.21(usec) ! (6.31) 


for the decay of the ground states. 


The 2-photon and 3-photon decays are the dominant processes; decays into more photons 
are higher-order corrections. 
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6.4.1 A Puzzle with ORTHO 


Until very recently more accurate calculations yielded a puzzle. While the 
agreement between theory and experiment for PARA was always in good 
shape, there existed a discrepancy between theory and experiment for ORTHO 
that went unexplained for decades. Experiments as recently as 1990 differed 
by at least 6 standard deviations from the theoretical calculation. This led 
theorists to propose that all kinds of exotic hypotheses that were sometimes 
rather bizarre extensions of the Standard model. These ideas included axions, 
C-odd bosons, millicharged particles, forbidden numbers of gamma rays, and 
even a mirror universe! 


y -Detector (12.5 cm) A 


Porous 


— Film 


y -Detector (1 of 4) 


FIGURE 6.5 

A schematic of the cavity used in the new precision orthopositronium de- 
cay rate measurement. Positrons are focused through two apertures of an 
aluminum cavity onto an porous silica film. The emitted thermal positron- 
ium decays in vacuum. Reprinted figure with permission from R. S. Vallery, 
P.W. Zitzewitz, and D.W. Gidley, Phys. Rev. Lett. 90 203402 (2003) [50]. 
Copyright (2003) by the American Physical Society. 


However, in May 2003, R.S. Vallery and colleagues published a paper [50] 
describing the results of a more careful experiment on orthopositronium. They 
created orthopositronium by firing a low-energy positron beam into a spe- 
cial micron-thick nanoporous silica film; orthopositronium formed from the 
slowed-down positrons as they captured electrons. Vallery and collaborators 
were able to measure how long this took by detecting the gamma rays after 
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annihilation in a scintillator. This set-up, illustrated in fig. 6.5, overcame 
problems encountered in previous experiments to measure decay rates, which 
sometimes measured energetic positronium annihilating on the cavity walls 
of the detector. In the Vallery experiment, only positrons that annihilated 
their bound electrons were detected. They measured a lifetime for ORTHO 
in agreement with the current QED calculation that differed by only about 
0.01496 from the theoretical value! 

The moral? Sometime a simple explanation — in this case, that something 
was wrong with the experiments — is the right one. 


Theory Experiment 
P(PARA) 7.9852: .010 (nsec) ! [48] 7.994 + .011 (nsec) ! [5 
7.0516 + .0013(usec) | [52] 
7.0514 + .0014 (usec) ~* [53] 
7.0482 + .0016 (usec)  ! [54] 
7.0404 + .0018 (usec) ~* [50] 


T(ORTHO) 7.039979 + .000011 (usec)”* [49] 


6.4.2 Testing Fermion-Antifermion Parity 


Consider next the parity of positronium. For PARA we have 


where the first minus sign is due to the opposite parity of the electron and 
positron, and the (D comes from the parity of the spatial wavefunction. 
Since / — 0 for the ground state, the final state for the 2 emitted photons 
from PARA must have negative parity. 


ND j 
PARA in Ground State Va 
FIGURE 6.6 


Schematic diagram of Positronium decay. 


In the rest frame of PARA the 3-momenta of the photons must be k and —k 
as shown in figure 6.6. The initial state has no angular momentum (J = 0). 
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The final state wavefunction |y1y2) can only depend on the photon momenta 
and polarizations, and must also have J = 0 by angular momentum conser- 
vation. Therefore it must be a scalar function of the momenta and polariza- 
tions. Furthermore, since photons are bosons, we must have |y1 y2) = + |7271)- 
Hence " 

l^ ya) = A61-:65 + B (ê x 69) -k (6.33) 
where A and B are scalar functions of the momentum and polarizations. Now 
by conservation of parity we must have P|y1y2) = — |yiyo). Hence 


Ima2) = B (êi x €2) - k (6.34) 


since P ( ê x 63) E k =- (êi x 69) . k but Pé; i 63 = 61 i êz. The amplitude 
(PARA |y172) is largest for €, L és. 


Positron source 


FIGURE 6.7 
Schematic diagram of test of parity conservation in positronium decay. The 
polarization angle ¢ corresponds to a rotation perpendicular to the page. 


If one emitted photon exhibits an X-polarization, the other always shows 
a Y-polarization, i.e., the planes of their polarizations must be perpendic- 
ular to each other. This may be confirmed experimentally by utilizing the 
feature that Compton-scattering cross sections for polarized photons are sig- 
nificantly greater for scattering into the plane at right angles to the E-vector 
of the incident photon, i.e., 90° to the direction of polarization. The setup 
is schematically shown in figure 6.7. The optical analog of the scattering 
material is the polarizing filter. The Klein-Nishina formula [56] shows the 
scattering cross section c is proportional to: 


k k 
c = — + | — 2sin? 6 cos? à (6.35) 
ko k 
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with ko and k representing the momenta of the incident and the scattered 
photons respectively, 0 the angle of scattering, and ¢ the angle between the 
plane of scattering and the é-vector of the incident photons. 

This experiment was first done by Wu and Shaknov [57] with %Cu as the 
source. At 0 = 90°, annihilation radiation scattering turns out (for the ener- 
gies relevant to the source ^Cu) to be two times stronger for ¢ = 90° (when 
é1 L 63) than for ¢ = 0° (when 6; || 62), thus providing an effective y-ray 
polarization analyzer. They observed 


Rate(¢ = 90°) 


e cries Dade: ; 
Kad 0s] 04 + 0.08 (6.36) 


in agreement with the expected value of 2.00. We can regard this experiment 
as demonstrating that electrons and positrons have opposite parity. 


6.5 The CPT Theorem 


Strong and electromagnetic interactions are observed to separately conserve 
C, P and T and our present theories of these interactions (QCD and QED) 
are constructed so that these symmetries are preserved**. 

Weak interactions, however, violate C and P separately. Parity-violation in 
B-decay was first observed by Wu et al. [39] and has been seen directly in 
nuclear reactions [40] such as 


160 12 
payo  JP=2 


+ -4He T = (1.040.3) x 107105 (6.37) 


Weak interactions also violate C-invariance: no left-handed antineutrinos have 
ever been observed. And finally, Kaon decays violate both C and P. We'll 
look at all of these processes in subsequent chapters. 

There is a theorem by Schwinger, Lüders and Pauli [58] called the CPT- 
theorem which states that: 


ANY Lorentz-invariant Hermitian Lagrangian is invariant under 
CIPT provided 


1. The ground state is invariant. 


2. 'The theory is local (i.e. has no action at a distance). 


**There is actually a subtle exception to this for the strong interactions that we will look 
at in Chapter 25. 
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A given theory might violate any of C, IP or T separately, but it must respect 
the combined operation CPT if these two conditions hold. Experimentally this 
means 


1. Particles and antiparticles must have the same mass 
2. Particles and antiparticles must have the same lifetimes 
3. Particles and antiparticles must have the same magnetic moments 


Any discrepancy in such experiments would signal a breakdown of quan- 
tum mechanics! Hawking and Penrose have each separately suggested that 
quantum gravity forces such a breakdown [59]. For example, the presence of 
black holes might imply that pure states evolve into mixed states (something 
that can't happen in quantum mechanics) because part of the wavefunction 
is “absorbed” by the black hole is therefore irretrievably lost if the black hole 
evaporates in a purely thermal fashion!!. 

Table 6.3 summarizes the results of a few key experiments that test the 
CPT theorem. So far CPT violation has not been observed — yet! 


TABLE 6.3 

Sample Tests of CIPT 
Lifetime Tu* /Ty- = 1.00002 + .00008 
Mass [mgo — mol /Maverage < 10718 


Magnetic Moment pe+/pe- < 107% 


6.6 Questions 


1. Are any of these processes allowed? Why or why not? 
(a) p atta (b 1° — 5y 
(c) p? — 37° (d ne — rt +r 47° 
2. According to the Standard Model, which of the following reactions are 


allowed and which are forbidden? State the reasons why if not. If 
allowed, state what interaction is responsible for the process. 


tt This is because no information is contained in the outgoing thermal radiation. 
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(a) m—ptet+yv (b) y+ Z? — ve +7” 
(c) u* - 77 ytyt+y (d Z9 — D, + 
(e) DO — K- + pt (f) e +0 —t+b 


3. Consider the non-relativistic Schroedinger equation with the Hamilto- 
nian E 
|p 3 
H=—+V 
am + Y (E) 
satisfied by the wavefunction Y (4). Under what circumstances will 
V (—2) satisfy the Schroedinger equation? 


4. The deuteron is a ?S, bound state of a neutron and a proton, and 
must have a wavefunction that is antisymmetric under proton-neutron 
interchange. What is the parity of the deuteron? 


5. Show that Maxwell's equations are invariant under time-reversal. 

6. Suppose the expectation value of J-P were found to be nonzero in 
some process. 
(a) What would this imply about parity conservation? 
(b) What would this imply about time-reversal invariance? 


7. Time reversal interchanges initial and final states, so that if T |) = |x)* 
then 


T ((9 10) = (9 Ix)" = (x 18) = (9 (10) 


where the first equality is due to interchange of initial and final states 
and the second equality is the property of quantum-mechanical ampli- 
tudes. 


(a) Show that the above relation implies Tc = c*T where c is any com- 
plex number. 


(b) Given that TJ = —JT where J is the angular momentum operator, 
show that 


1 1 11 
=-T]-=,-= =T|-,= 
1x1) B 5) and Ix2) 1153) 
form a spin-1/2 in the time-reversed system. 


(c) What is T? for a state |) with an odd number of spin-1/2 particles? 
What is T? for a state |y) with an even number of spin-1/2 particles? 


(d) Suppose the state |ó) in (c) is an eigenstate of the Hamiltonian. 
What is the degree of the degeneracy of this state? (ie. how many 
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distinct states are there at a given energy level?) This degeneracy is 
called Kramer's degeneracy. 


(e) Is Kramer's degeneracy preserved in an electric field? Is it preserved 
in a magnetic field? 


T 
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Experimental information in high-energy physics has historically most “com- 
monly” been obtained from accelerators: machines that accelerate charged 
particles to very high speeds and then let them collide with other particles. 
They are very expensive to build and maintain, and they have a limited use 
in terms of the fundamental physics that they can reveal. Yet they provide us 
with a kind of information about the subatomic world that cannot be obtained 
in any other way. 

Accelerators basically do two things. First, they provide us with informa- 
tion about the detailed structure of subatomic systems. They produce new 
interactions and/or bound states of known particles, whose characteristics 
provide us with further information about the laws of nature and the struc- 
ture of matter. Second, they produce new particles. This is why they cost so 
much money — in order to produce a new particle you need at least as much 
energy as the rest mass of the particle. This might not sound like much at the 
subatomic level, but there is a lost of energy “wasted” in a collision process 
because we can't directly control the products of the collision that emerge nor 
the loss of energy due to other effects. 

Accelerators have played an essential role in particle physics. Without them 
it would simply not be possible to check in any detailed way whether or not 
our theories were correct. They have led to the discovery of particles and 
interactions that nobody anticipated, and have provided us with a picture of 
the subatomic world that no one imagined as recently as a century ago. They 
probe the shortest distances humankind has ever measured by manipulating 
beams of particles traversing millions of kilometers in a few seconds to micron 
precision. They have been compared to the great cathedrals of Europe by 
Robert R. Wilson because of their immense size, intricate complexity, and 
symbolic representation of the human intellect [62]. 

The first accelerators that were constructed in the 1930s had particle beams 
whose energies were a few hundred keV. Seventy years later, in the first decade 
of the 21st century, the Large Hadron Collider will generate particle beams 
with energies of nearly 10!?eV — a factor 100 million times greater! The 
effective energy available to study new physics is even larger, about 10'8eV, 
since the LHC arranges for two beams of similar energy to collide. This will 
allow us to probe distances shorter than 10^ !?cm, yielding the world’s most 
powerful microscope. 

Accelerators have grown out of their core purpose of providing experimen- 
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tal information for particle physics, and today are used in condensed mat- 
ter physics, biomedical technologies, geophysics, electronics, food processing, 
and many other areas. Accelerator science has become a separate intellec- 
tual subdiscipline with both pure and applied aspects, providing yet another 
illustration of how basic research can foster positive economic and social de- 
velopment. 

This chapter will provide a brief overview of accelerator physics. Following 
an historical path, we shall begin with the earliest machines, sketching the 
emergence of the more complex technologies as they developed over the past 
70 years. 


7.1 DOC Voltage Machines 


The simplest way to accelerate charged particles is with a high voltage DC 
source [61]. Such machines today can at best achieve beam energies of 20 
MeV. For nuclear physics experiments this is useful, but for particle physics 
this is too low an energy. A picture is given in figure 7.1. 


lon source 


Vacuum pump 


FIGURE 7.1 
Prototype of the simplest accelerator. 


A particle of charge q is generated by a source of ions. It is then accelerated 
in an electric field E, which means it experiences a force F', and consequently 
gains energy € as it travels a distance d 


F=qE>€ DE q|E| d — av (7.1) 


where V = |E | d is the voltage of the machine. The system must also be 


in as high a vacuum as is manageable — otherwise the accelerating particle 
will lose most of its energy in collisions with air molecules in the accelerator. 
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These elements — particle source, accelerating structure, and vacuum pump — 


appear in every accelerator. 
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FIGURE 7.2 


A Cockcroft-Walton Accelerator. Diagram courtesy of the Contemporary 
Physics Education Project (used with permission). 
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Electric Reld 
Across Gap 


'The earliest such machine of this type is called a Cockroft- Walton accelera- 
tor [63], shown in fig. 7.2. The ion source was hydrogen gas, and the original 
machine developed by John Cockroft and Ernest Walton was able to acceler- 
ate protons to 400 keV. Such machines today can reach a maximum voltage 
of 1 MeV due to voltage breakdown and discharge. These machines are often 
used as the first step of a multistage process that accelerates particles to much 
higher energies in more powerful accelerators. 


A more sophisticated DC machine is a Van de Graff accelerator [64], in 
which a conveyor belt carries positive charge (in the form of ions that are 
sprayed onto the belt) to a collector which in turn transfers the positive charge 
to the dome. The principle at play here is that charge on any conductor 
resides on its outermost surface. If a conductor that is carrying charge touches 
another conductor whose surface surrounds it, then the charge on the first 
conductor will be transferred to the second one. Hence, as Robert van de 
Graff realized, one can “pile up” charge on a conductor to increase voltage by 
continually transferring charge to it via a conveyor belt, as shown in fig. 7.3. 


This technique can yield voltages of up to 12 MeV. Positive charge (obtained 
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lower brush 


FIGURE 7.3 
A diagram of a Van de Graff accelerator. 


from gas ionized at high voltage) is sprayed or brushed onto a conducting 
conveyer belt, which continually rotates about two drums (somewhat like the 
fan belt in a car). One end of the belt is inside of a conducting dome. A wire 
brush is attached to the sphere and brushes against the belt. The charge on 
the belt will then travel through the brush and spread out on the sphere. The 
motors turning the drum provide the work needed to carry out this process. 
The points at which the charges are sprayed onto the belt are called corona 
points. 


To complete the accelerator an ion source must be placed within the con- 
ducting dome near an evacuated tube. This tube leaves the dome and provides 
a conduit for the accelerated particles to eventually hit a target. If positive 
ions are emitted from the source, they will be accelerated away from the 
positively charged dome down the tube and toward the target. The tube is 
constructed with equipotential metallic rings, embedded within an insulating 
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tube, and the entire structure is contained within a pressurized chamber of 
gas (typically SFg) of about 15 atm. The energy limit that the machine can 
reach is constrained by the voltage at which the gas discharges. 

Higher energies can be attained through the following trick. If negative ions 
are emitted from a source at one end of an acclerating tube that is outside of 
the dome, they will be attracted to it. A stripper inside the dome can remove 
some of their electrons and make them positive. These ions again accelerate 
down another tube, away from the terminal and toward a target. In this 
manner the maximal energy can be doubled to about 25 — 40 MeV. Such a 
machine is called a Tandem Van de Graff [65]. 

DC machines cannot achieve the requisite energies of modern particle physics. 
However, they have a high beam intensity (up to 100 microamperes) and a 
stable beam energy, and so are useful in nuclear physics research and (more 
recently) in solid state physics, where these machines are used to implant ions 
into materials to achieve a desired doping. 


7.2  Linacs 


Linear accelerators (LINACS) attempt to overcome the aforementioned limi- 
tation by giving the charged particle a series of *kicks" using an AC source. 
The basic idea is to repeatedly accelerate the particle many times over. A 
LINAC does this via a series of cylindrical tubes (called drift tubes), each of 
which is connected to a high-frequency oscillator. The succesive tubes are 
arranged to have opposite polarity. Inside the tubes the electric field is zero, 
but in the gaps in between, the electric field alternates with the generator 
frequency. 

Suppose a particle of charge e enters this setup. The electric field at the first 
gap is set so that it attracts the particle, accelerating it to the first tube. The 
length of this tube is arranged so that when the particle arrives at the next 
gap the relative voltages of the tubes have flipped so as to provide another 
accelerating field in the gap. This further accelerates the particle and the 
process is repeated up to the tolerance voltages of the device, as illustrated in 
fig. 7.4. Each tube must increase in length because the speed of the electron 
rapidly increases as it moves down the tube. The length L of the tube must 
equal ivT, where T is the period of the oscillation and v is the speed of the 
particle. 

Typically such machines gain a few MeV per meter in beam energy. Proton 
linacs typically reach about 50 MeV; the best in the world is the meson factory 
at Los Alamos that can reach 800 MeV. Electron linacs can reach much higher 
energies (~ 25 GeV maximum) since the electron is much lighter. The largest 
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A schematic drawing of a linac. 


electron linac in the world is at Stanford. It is 3 km long and has a 25 GeV 
electron beam which pulses 60 times per second. 


7.2.1 Focusing the Beam 


One of the problems with such big machines is in keeping the beam well- 
collimated. Light from a flashlight, say, will spread out, and the same kind 
of thing will happen with a particle beam. And, just as the spreading light 
can be refocused with lenses, the same sort of thing can be done for particle 
beams by using magnets*. The Lorentz force equation describes the motion 
of a particle of charge q in an electromagnetic field: 


F=a(8+ 1x8) 0x8 (7.2) 
C C 


where Y is the velocity of the particle and we have set E — 0. For &LB, the 

2 

mv? 
T 


particle will experience a centripetal force |F | = normal to both of these 


directions, and so 


mu MUC C 
pe pe 


—— 7.3 
T c qB qB 2) 


is the radius of curvature r through which the particle bends as it goes through 
the magnetic field. 

To get an idea of how big the magnetic fields are that are required for focus- 
ing, consider an electron whose kinetic energy Exin is 1 MeV. Its momentum 


*Electric fields could also be used, but the field strength required for focusing high-energy 
particle beams is impractically large. 
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Defocusing plane 


Focusing 
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FIGURE 7.5 
A schematic diagram of a quadrupole magnet. 


is therefore given by solving 


Erin = \ (pe)? + (m)? — mc? 2 pe = 142 MeV = 142 x 109eV — (7.4) 


since the mass of the electron is 0.511 MeV. Because the charge of the electron 
is 1.60217733 x 1071? Coulombs = 3 x 10?esu, we find from eq.(7.3) 
pe  142x 108 


EM CE O A NE 3 la 
Br= 7 3x 105 4.73 x 10° Gauss-cm (7.5) 


Unfortunately a magnet can bend particles only in one plane, and so can 
focus only in this plane, unlike an optical lens that can focus in more than 
one plane. How can a magnetic field be used to focus in two planes? The 
solution to this problem was found in 1950 by Christofilos [66] (and again in- 
dependently in 1952 by Courant, Livingston and Snyder [67]): the quadrupole 
magnet! This magnet focuses in one plane, and defocuses in the orthogonal 
plane — see figure 7.5 for a conceptual representation of how this works. In the 
diagram, particles in the horizontal plane are deflected inward, while those in 
the vertical plane are deflected outward. Rotate this magnet by 90° and the 
opposite effect occurs: particles in the horizontal plane are deflected outward, 
and those in the vertical plane are deflected inward. Consequently two such 
magnets rotated 90° to one another around the beam axis behave as an op- 
tical lens, and a net focusing occurs. A picture of a quadrupole magnet that 
was used for PEP (Positron-Electron Project) at SLAC is shown in 7.6. 

Fig. 7.7 is a diagram of the SLAC linac, which is the world’s highest energy 
LINAC. 

LINACs can achieve arbitrarily high energies in principle. However, the 
greater the desired energy, the larger the cost — a 500 GeV accelerator would 
have to be 75km long! This is prohibitively expensive and environmentally 
costly. A new solution for achieving higher energies is required. 
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Prototypes of the magnets used in the 
synchrotron storage ring, The large 
yellow one forces electrons to turn a 
corner, creating high intensity light. 


The smaller green and red magnets help 


focus and steer tho electrons. 


FIGURE 7.6 


A quadrupole magnet once used in the storage ring at the Australian Syn- 
chrotron, Clayton, Victoria, Australia. Photo by John O’Neill; used with 
permission. 
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FIGURE 7.7 


A diagram of the LINAC at SLAC. Image courtesy of SLAC National Accel- 
erator Laboratory. 


7.3 Synchrotrons 


The idea of the synchrotron is to use one voltage source to repeatedly acceler- 
ate the particle instead of many sources as in the linac. Instead of the particles 
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moving in a straight line, they move around in a circle, their motion synchro- 
nized with a changing voltage source and magnetic field that increases their 
energy. This idea of circularly accelerating particles was proposed in 1930 by 
Lawrence [68], and such machines were called cyclotrons. 

A cyclotron consists of two hollowed out metal vaccum chambers, each in 
the shape of the letter D. These are placed side-by-side along their straight 
edges with a gap, and each is connected to an alternating high voltage source. 
This entire setup is then placed inside a magnetic field that is perpendicular 
to the D-shaped chambers. The high-voltage source produces an electric field 
only in the gap between the D’s because the metal chambers shield the in- 
sides, where only the magnetic field pervades. If an ion source is placed in the 
gap, the electric field will accelerate ions toward one of the D-chambers. The 
magnetic field meanwhile causes the ion to move in circular motion. By ap- 
propriately setting the alternating frequency of the voltage source, the ion, as 
it leaves the first D-chamber, will be accelerated toward the second one. This 
process can be repeated many times, with the ion being accelerated across 
the gap each time it leaves a D-chamber. Its speed and its radial orbit will 
continue to get larger until one wishes to extract it (say by turning off the 
magnetic field) to have it impact upon a target. 

A fixed-frequency cyclotron cannot accelerate particles to high energies, 
where relativistic effects must be taken into account. The maximum energy 
a proton cyclotron can obtain is 20 MeV. A more sophisticated machine is 
needed to attain higher energies. These machines are called synchrotrons. 

A synchrotron consists of straight segments in accelerating cavities com- 
bined with circular segments that cause the particle to repeatedly traverse 
the same trajectory. The charged particles are first linearly accelerated into 
the ring, and then traverse a vacuum tube in a torus. A magnetic field keeps 
the particles moving in a circle. The straight segments have a RF field that 
turns on as the particles enter the cavity, accelerating them to higher speeds. 

For a particle injected into a ring of radius R at speed v, the time for one 
full turn is 

27R | 2rR 


T = 
U pe? 


€ (7.6) 


since p = myv and € = myc’. Hence the circular frequency is 


2m pæ 
= — = T.T 
T ER (1-7) 
and the acceleration of the particle is 
JUNE INE 
— = -0x B 7.8 
dt c. à D 
where the magnetic field is orthogonal to the direction of motion. Recalling 
that Y = v0 for circular motion, we have Y x B = —vBf. Similarly, we have 


dp dp, dÊ dp; 
aE fot rs, S ogg (7.9) 
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which for constant p = |p] yields 


EU E — SoBe. (7.10) 
or alternatively 
pc 
B= — 7.11 
qR ( ) 


Of course p is not constant: once per revolution, the particles are accel- 
erated to increase v. Obviously B must increase in a synchronized manner 
(otherwise the particles will crash into the walls of the tube) — hence the name 
synchrotron. As the particles are accelerated by an RF generator of frequency 
w, they will gain energy and momentum. The frequency w must be an integer 
multiple of (2 to keep the beam within the tube. As the energy increases, we 
have pc — E and so 


2 k 
Perse Liu and LL N 


= A 12 
ER R up tam 61) 


since pc — € at high energies. The magnetic field and the RF are increased 
from their initial values to final values chosen in such a manner as to always 
maintain the above relations. Clearly the limitation on the beam energy is B. 
'The best superconducting magnets currently furnish magnetic fields slightly 
larger than 5 Tesla (50 kilogauss). 

A notable feature of large acclerators is that the particles cannot be accel- 
erated from zero (or small) velocity into the machines — the range over which 
the RF and magnetic fields would have to operate is too big. Consequently 
such machines are built in stages, with smaller machines pre-acclerating the 
particles to speeds that the larger machines can handle. The Large Hadron 
Collider (LHC) at CERN is an excellent example of a machine that makes use 
of smaller linacs and synchrotrons to achieve high energies, as shown in fig. 
7.8. 


7.3.1 Focusing Beams at Synchrotrons 


Particle beams in general do not consist of streams of charged particles. In- 
stead, they occur in clusters, or bunches. The reason for this is that there is 
always some finite spread in the time of arrival of the particles as they enter 
an acceleration region. Consider a cyclotron. Particles arriving “on time” 
will experience just the right electric field to keep them moving in the correct 
orbit in the D-chambers. A particle arriving earlier will experience a stronger 
electric field, causing it to traverse an orbit of larger radius. This in turn 
causes it to return to the gap at a later time, closer to the return time of the 
original “on time” particles. A particle arriving later will experience a weaker 
electric field and thus traverse an orbit of smaller radius, causing it to return 
to the gap earlier, which again is closer to the return time of the original “on 
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time" particles. The second time around, the advanced particles will arrive 
less early and so be accelerated less, whereas the delayed particles will arrive 
less late and so be accelerated more. This continues for each orbit, resulting 
in the bunching of particles about the synchronous orbit. This bunching is 
the “RF structure" present in all such acclerators. 

Ideally, the particle bunches would move uniformly, all accelerate in step, 
and hit the target precisely. In practice, small misalignments of the beam, 
magnet inhomogeneities, etc. cause the beam (or rather the bunches) to wan- 
der. Deviations from the ideal circular path are called betatrom oscillations. 
With appropriate focusing using quadrupole magnet pairs (quadrupole dou- 
blets) as discussed above, these can be made quite small compared to the 
beam radius R. Longitudinal oscillations of a bunch are called synchrotron 
oscillations. Appropriate RF “kicking” stabilizes these bunches, which oscil- 
late in size around the ideal equilibrium position. A typical beam thickness is 
~ 1 mm. The concept of stabilizing the bunches via corrective field methods 
was developed independently by Vladimir Veksler [69] in 1944 and by Edwin 
McMillan [70] in 1945. 

Proton synchrotrons operate by first accelerating the protons in a Cockroft- 
Walton machine to about 1 MeV, after which they are further accelerated into 
a linac before injection into the synchrotron, typically at an energy of several 
hundred MeV. The magnets are positioned in a ring along the circular path 
of the beam line. The world's largest proton synchrotron beam is the LHC at 
CERN. 

Since electrons are lighter and so much easier to accelerate, why not build 
electron synchrotrons? This can be done, but there is a significant cost because 
all charged particles radiate energy as they accelerate. Circularly moving 
particles at higher and higher speeds accelerate more and more, losing energy 
AE with each revolution: 


An g2 9^4 An q? E 4 
A AN e ) vse (7.13) 


mc? 


where the formula for energy loss is given in Jackson (eq. (14.31)) [71]. Hence 
the energy loss varies inversely with the fourth power of the mass, and so 


(AE) oi Mproton V^ 
an = p ~ 1013 (7.14) 


proton Melectron 


which is a huge energy loss: at 20 GeV beam energy it is 16 MeV per turn. 
This makes electron synchrotrons a very expensive but excellent source of 
intense short-wavelength light, called synchrotron radiation. This emitted 
radiation permits unique research in a variety of scientific fields in physics 
chemistry and biology. The Canadian Light Source in Saskatoon, Canada, is 
a good example of a state-of-the-art machine of this type. However, for the 
purposes of particle physics, proton accelerators provide much more energy 
per unit cost. 
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7.4 Colliders 


All of the aforementioned machines are fized-target machines: the particle 
beam, after reaching its optimum energy, hits some target. Nearby detectors 
then measure what happens. This used to be the way that all high energy 
experiments were carried out. In the last 30 years, a new type of machine 
called a collider has become important. Here 2 beams are smashed into each 
other, with nearby detectors monitoring the collision. 


Why is this better than a fixed-target machine? Consider 2 particles with 
4 momenta ph and p" for the beam and target respectively. The total 4- 
momentum p^ is, by momentum conservation 


2 
p^ = pt + p! > p = (p, +p.) 
> p? = Py TW + 2p, * Pe 
> pli = mé + mt +E E, —P, Rc) (7.15) 


where m, and m, are the masses of the beam and target particles respectively. 


The center of momentum system (CMS) is defined to be that system in 
which p^ = (E,,,,0). If the target is at rest, then p, — 0. and so 


E? = mec! + mich + 2E,m,c? > Evora > V2E mc (7.16) 


for beam energies E, > m,c?, m,c?. However, if both the beam and the target 
are moving toward each other so that p, + pe = 0, then 

E2 a =m044+m2044+2E,E, + |. 2) > E Y 25, (7.17) 
for beam energies E, ~ E, > m,c?,m,c?. Hence in fixed target machines, 
the total energy increases as the square root of the beam energy, whereas for 
colliders it increases linearly with the beam energy. Clearly much more total 
energy is available for particle creation in colliders! 


Colliders have several disadvantages. The particles in the beam must be 
stable, unlike the previous machines we have considered, which can be used 
to produce secondary beams of unstable particles. Hence only protons, an- 
tiprotons, electrons and positrons can be used in colliders. The other (more 
serious) disadvantage is that the collision rate is low. The relationship between 
the rate R and the other parameters of the beam is: 


Nı N2 
A 


R=o0L with £ — fn 
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where the quantities associated with the luminosity £ are 


f = frequency of revolution of bunches 
n = number of bunches per beam 
N,, No = number of particles per bunch in beams 1,2 
A — area of beam 
c = cross-section for interaction 
(computed from underlying theory) 


For colliders, £ is typically ~ 10?! /cm?/s , whereas in fixed target machines 
L ~ 10%/cm?/s. This is a necessary trade-off between the two kinds of 
machine — attaining higher energies comes at the cost of reduced luminosity. 


AA ring injected bunch to center 
kicker of AA ring 


pickup 


cooled stacked 
bunch bunch 


FIGURE 7.9 

Schematic diagram of stochastic cooling. A pickup coil delivers a signal de- 
pending on deviation from the antiprotons from the ideal orbit, and this acti- 
vates a kicker which deflects them toward the ideal orbit as they come around 
the ring. After the bunch is cooled, it is kicked into the inner half of the 
chamber and stacked together with previous bunches. 


'The cross-section, denoted by c, is a quantity that is characteristic of the 
fundamental physics governing the interaction, and can be thought of as the 
effective area that one particle presents to another. We shall defer a discussion 
of its properties until chapter 9. 

Non particle-antiparticle machines (e.g. ep or pp) need 2 separate beam 
pipes and 2 sets of magnets. Particle-antiparticle machines (e* e^ or pp) need 
only one pipe and set of magnets. pp machines pose unique problems in that 
obtaining a beam of antiprotons is much harder than obtaining a positron 
beam. Antiprotons are produced from fixed target proton-nucleus collisions: 
these have a low yield and give a hot gas of antiprotons. 

To cool this gas a technique called stochastic cooling was developed by 
Simon van der Meer, illustrated in fig. 7.9. The antiprotons are placed in a 
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FIGURE 7.10 
A diagram of the facilities at the Fermilab Tevatron. Image courtesy of Fer- 
milab National Accelerator. 


ring. A sensitive pickup coil in one part of the ring senses the deviation from 
an ideal path and sends a signal to a kicker in another part of the ring to 
deflect them to an ideal orbit. It takes about 2 seconds of circulation to cool 
the injected beam. 

After cooling, the injected bunch is put into a stacking ring in the same 
tube. After a day or so about a trillion antiprotons exist in the beam and 
can be used for experimentation. This method was used in the CERN SPS 
collider to obtain proton-antiproton collisions which led to the discovery of 
the W and Z bosons [73]. 

The highest-energy machine currently operative is the Tevatron at Fermilab. 
A schematic illustration of the setup at Fermilab is shown in fig. 7.10. The 
TEVATRON at Fermilab is 1 km in radius and can achieve a beam energy 
of 1000 GeV — 1 TeV (tera electron volt). It is a pp machine, which uses a 
Cockroft- Walton, a Linac and a booster ring in order to get the protons to 
sufficiently high energy before they enter the main ring. 

One machine that played a very important role in experimental particle 
physics in the 1990s was called LEP (for Large Electron-Positron machine). 
This machine was designed to run at collision energies (about 90-100 GeV) 
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that would allow the Z-boson to be produced in large numbers. This worked 
very well, and an enormous amount of empirical information was obtained 
about the Standard Model from the experiments done at this facility. In 
April 2000 the energy of LEP's particle beams was boosted to be as high as 
possible — up to 209 GeV, well beyond the original design energy. Significant 
experimental data was accumulated at a center-of-mass energy in excess of 
206 GeV, and a number of events compatible with a Higgs boson production 
with mass around 114-115 GeV were reported in the combined results of the 
four LEP experiments, ALEPH, DELPHI, L3 and OPAL. Unfortunately the 
topology of these events is also compatible with those originating from other 
known Standard Model processes, and so it is at present impossible either to 
rule out or confirm the existence of a 114-115 GeV Higgs boson [74]. LEP 
was shut down several years ago so that its facilities could be renovated to 
convert it to the LHC. 
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FIGURE 7.11 
The four main experiments and the two ring structure of the LHC (copyright 
CERN; used with permission). 


Higgs bosons of such a light mass are expected to be copiously produced 
at the LHC, the centerpiece of the future scientific program at CERN. The 
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LHC is housed in a circular tunnel 27 km in circumference, buried at a depth 
of 50 to 175 m underground on the Swiss/French border on the outskirts of 
Geneva, with a planned start-up in 2008. This machine will achieve energies 
of 7 TeV+7 TeV for a proton-proton beam (the highest ever attained), and 
luminosities of 10?" /cm? /s. For most of the ring, the beams travel in two sep- 
arate vacuum pipes in opposite directions, but at four points they collide in the 
hearts of the main experiments, known by their acronyms: ALICE (designed 
to see if it is possible to make and detect a quark-gluon plasma), ATLAS 
(whose purpose is to find the Higgs), CMS (which will look for evidence of 
supersymmetry), and LHCb (whose purpose is to make precise measurements 
of CP-violation in the b-quark sector). A conceptual diagram of the LHC at 
complex appears in fig. 7.11, and the tables 7.1, 7.2, and 7.3 should help you 
to keep all the acronyms straight. 


TABLE 7.1 

Accelerator Acronyms 
AAC Antiproton Accumulator Complex (LHC) 
AGS Alternating Gradient Synchrotron (Brookhaven) 
CESR Cornell Electron Storage Ring 
CLIC Compact LInear Collider (proposed CERN) 
DAFNE/DAPHNE Double Annular Factory for Nice Experiments 
EPA CERN’s Electron Positron Accumulator 
FMI Fermilab Main Injector 
FNAL Fermi National Accelerator Laboratory 
HERA Hadron-Electron Ring Accelerator (DESY) 
KEK B-Factory CP-violation in the B meson (KEK) 
LEAR/LEIR Low Energy Ion Ring 
LEP Large Electron Positron collider 
LHC Large Hadron Collider 
LIL Lep Injector Linac 
PEP Positron Electron Project (SLAC) 
PSB Proton Synchrotron Booster (CERN) 
PS Proton Synchrotron (CERN) 
RHIC Relativistic Heavy Ion Collider (Brookhaven) 
SLAC Stanford Linear Acclerator 
SPEAR Stanford Positron Electron Accelerating Ring 
SPS Super Proton Synchrotron 


Tevatron Fermilab’s 2-TeV proton-antiproton accelerator 
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TABLE 7.2 
LaboratoryAcronyms 
ANL Argonne National Laboratory, (Argonne, Illinois, USA) 
BNL Brookhaven National Laboratory (Upton, Long Island, USA) 
CERN Originally “Conseil Europenne pour Recherches Nuclaires" 
now European Laboratory for Particle Physics (Geneva, Switzerland) 
CLS Canadian Light Source (Saskatoon, Saskatchewan, Canada) 
DESY Deutches Elektronen SYnchrotron laboratory (Hamburg, Germany) 
FNAL Fermi National Accelerator Laboratory (Batavia, Illinois, USA) 
KEK Koo Energy Ken (Tsukuba, Japan) 
LNF Laboratori Nazionali di Frascati (Rome, Italy) 
SLAC Stanford Linear Accelerator Center (Palo Alto, California) 
SNO Sudbury Neutrino Observatory (Sudbury, Ontario, Canada) 


TRIUMF  TRIUniversity Meson Facility (Vancouver, British Columbia, Canada) 
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FIGURE 7.12 
The proposed CLIC facility (copyright CERN; used with permission). 


7.5 The Future of Accelerators 


The most important parameters of any accelerator are its final energy and its 
beam intensity. The energy is proportional to the magnetic field for a given 
radius of curvature, and so our ability to generate strong magnetic fields is 
a limiting factor here. With a typical iron magnet the field achieved can be 
about 20 kG (or 2 Tesla), whereas a superconducting magnet can achieve 
fields of 50kG (5 Tesla). These are now being used at Fermilab and the LHC. 
As far as bean intensity is concerned, stochastic cooling techniques can be 
used to ensure that this is under control. 


However, neither is enough to go much beyond the energies of the LHC. 
The key reason is cost, which runs into the 10s of billions of dollars. New tech- 
nologies must therefore be explored that will raise the energy threshold, and 
this is currently under study. One example is the Compact LInear Collider, 
or CLIC facility (see fig. 7.12). The basic idea of CLIC is to point two linacs 
toward each other, and to increase the beam energy to 1-3 TeV in each beam 
using an RF source of 30 GHz, for a total CMS energy of 2-5 TeV. If this 
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could be implemented, it would be the highest-energy particle acclerator in 
the world [75]. 


7.6 Questions 


1. Plans are made to upgrade TRIUMF in Vancouver so that it will be 
a 60GeV synchrotron with a high intensity (200 yA) beam of protons 
that can be used to produce several new kinds of subatomic particle 
beams (e.g. K s, TS, ns etc.). However, these beams are difficult 
to focus: the number of betatron oscillations Av varies as N/ (8??) 
where @ and y are the relativistic speed and energy parameters and N 
is the accelerable charge per pulse. A typical 60 GeV synchrotron has 
an injection energy of 300 MeV in its first stages. 


(a) How much greater must the injection energy be to increase the use- 
able beam current by a factor of 10 without defocusing? 


(b) How does this answer change if a 596 increase in defocusing can be 
tolerated? 


2. In the LHC two beams of protons will collide head on, each with energies 
of 7 TeV. 


(a) How does this energy compare to that of a typical cosmic-ray proton? 


(b) How much energy must a single proton beam have to yield the same 
CM energy on a fixed target of hydrogen? 


3. (a) A proton beam of kinetic energy 20 MeV enters a dipole magnet 2m 
in length. How strong must the field be to deflect the beam by 10°? 


(b) Suppose now the beam has 200 GeV of kinetic energy. How much 
deflection will be induced by a magnetic field of 25 kG? 


4. What is the minimum energy needed to produce antiprotons from the 
collision of two protons? Remember that conservation laws must be 
respected in the production process. 


5. (a) Two proton storage rings with a beam currents of 20 A each are 
directed to collide into each other. The interaction region has an area 
of 1 cm? and is 5 cm long. The relative velocity of the two beams is 
approximately the speed of light. How many collisions take place per 
second? 


(b) Now consider a proton beam colliding with a fixed target of liquid 
hydrogen that is 5 cm long and 1 cm? in area. How much current would 
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have to be in the beam for it to have the same collision rate as in part 


(a)? 


'The largest magnetic field possible in a superconducting magnet is about 
30 Tesla. What is the largest energy that protons can be accelerated to, 
in an accelerator that circles the Earth's equator? 


What would the energy loss per turn be for the acclerator in question 


#6? 


8 


Detectors 


DOI: 10.1201/9781420083002-8 


In a number of ways detectors are more fundamental in expanding our knowl- 
edge of particle physics than accelerators are [76]. Without particle acclera- 
tors we would still be able to learn much about particle physics by making 
detectors sensitive enough to probe naturally occuring cosmic rays, neutrinos, 
photons, and other subatomic particles. Of course an accelerator gives us 
crucial control over the energy of a given experiment, and its importance to 
particle physics cannot be understated. Yet without the proper accompanying 
detectors, accelerators are useless. 

After the collision of a particle beam with some target (or another beam) 
detection of what happens becomes the key task. In order to do this, the 
particle must leave some imprint of its presence, which is made possible by 
the fact that particles ultimately transfer energy to the medium they are 
traversing — if not, we’d never observe them! The construction and design of 
detectors depends on exploiting this property of energy transfer. There are 
many ways that this can happen, and we will begin by looking at the various 
possibilities. 


8.1 Energy Transfer and Deposition 
8.1.1 Charged Particles 


When a charged particle moves through a medium it will interact with the 
fundamental constituents of that medium: its nuclei and electrons. It can lose 
energy via three basic means: ionization, coulomb scattering, and radiation. 
Its primary interactions are with the atomic electrons of the medium, and 
this forms the dominant mode of energy loss*. The reason for this is that 
scattering from nuclei causes large changes in the momentum of the incoming 
particle but relatively small changes in its energy, whereas scattering from 
atomic electrons (or ionization of nuclei) is an inelastic process that entails 


*The incoming particle can also have a direct collision with a nucleus, but this is an ex- 
tremely rare process, and so can be neglected. 
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substantively more energy transfer. At very high energies, the dominant mode 
of energy loss is radiation if the incoming particle is an electron or a positron; 
for more massive particles this process is negligible. 

As a charged particle moves through a medium, we expect it lose energy 
through its interaction with the medium. The key quantity of interest is the 
rate (4E) of energy loss per unit path length that the particle traverses whilst 
in the medium, a quantity called the stopping power of the medium. Once we 
know what it is, we can compute the range R, that any particle will have in 


the medium: 5 x i 
dEN 
r= [ do= | (=) dE (8.1) 
0 0 dx 


The stopping power can be calculated for the different ways that the particle 
can lose energy. Let’s look at these. 


8.1.1.1 Ionization Loss 


At relativistic speeds v = 8c, with Lorentz factor y = (1 — gy A the energy 
loss of a charged particle as a function of distance can be reliably calculated 
from our knowledge of electromagnetism, and the formula! 


dE An N. 22 (Z 2m,c? 8? 
Cm) uc A) 


has been verified experimentally over a wide range of energies for different 
kinds of particles and media. This formula was derived by Hans Bethe and 
Felix Bloch [77], and describes the mean rate of energy loss of a particle of 
charge q = ze due to ionization of a medium with ionization potential J, 
atomic number Z and nucleon number A. Here e is the charge on the electron 
(mass me) and Na —10??mo1^! is Avagadro's number. Most of the energy 
loss is due to formation of ion pairs, either by the particle itself, or by the 
electrons in the ion pairs causing further ionization. The total number of ion 
pairs is proportional to the energy loss of the incident particle. 

The energy loss most strongly depends on the incident velocity @ and so 
can be used to evaluate this quantity. For small 8 we have 


dE An NA(zee? ( Z 2m,c? 9? 
Gamer (Com Jp — de» 


and we see that 


dE 
d; 90 n8 e 87? e (Mfp) (8.4) 


where M is the mass of the incident particle (to see this, recall that p — 
M^v = Moy); particles of the same momenta but different mass will have 


Derivation of this formula is beyond the scope of this text [78]. I have included it here to 
impress upon you that this energy loss can be quantitatively computed. 
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different energy loss. The energy loss decreases with increasing particle ve- 
locity, reaching a minimum value before growing as In(y?). Note that at very 
low energies the stopping power becomes negative. This unphysical value in- 
dicates a breakdown in the above formula, since a low-energy incident particle 
can now capture electrons from the medium and form its own atomic systems. 
Other approximations can be made at lower and higher energies that provide a 
good description of the stopping power over a very wide range of momentum, 
as illustrated in figure 8.1. 

At high energies eventually long-range interatomic screening effects (ignored 
in (8.2)) make £P approach a constant value, and it becomes impossible to 
distinguish particle types based on ionization loss. For solid media the increase 
is very slight, but for gaseous media there is a rapid increase due to relativistic 
effects before the plateau is reached. For about 2 orders of magnitude above 


the ionization minimum, YE is about the same for all materials, having a value 


E 
(=) ~ —2pMeV /g/cm? (8.5) 
L i ization 


where p is the density of the absorber. 
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Diagram of the stopping power for — dE /dx antimuons in copper as a function 
of momentum, plotted in terms of Gy = p/Mc. The solid curves indicate the 
total stopping power, and vertical bands indicate boundaries between different 
approximations. Image courtesy of the Particle Data Group [1]. 
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8.1.1.2 Straggling and Scattering 


Electromagnetism, being long range, will tend to scatter incident charged 
dE 1 


particles from those in the medium (electrons and nuclei) Since $7 ~ 37» 
only for M — m, (i.e., electrons and positrons) is this energy loss appreciable. 
Energy loss of incident nuclei due to electromagnetic interactions is negligible. 
This kind of energy loss is called straggling and occurs because the energy that 
the incident particles transfer to the medium (i.e., the detector) is variable. 
Sometimes a lot will be transferred to the particle, and other times only a little 
energy will be transferred. If we know the functional form that describes the 
energy transfer, we can compute the mean energy loss and its dispersion. 
For example, suppose that the cross-section describing energy transfer to a 


given detector is 


do 2/32 

ig 7° rgo (8.6) 
where q? is the square of the momentum transferred to the detector from 
the beam and A is a constant characterizing the detector. This formula 
approximately describes the scattering of nucleons off of nuclei of radius A^! 
at low energies. If the detector is made of particles of mass M and the incident 
(beam) particles are of mass m, then conservation of 4-momentum implies 


H H — pe Ip 
Pheam + Daetector EM DPocam n Daetector (8.7) 


and the transfer of momentum (as we shall see in Chapter 13) is defined to be 


a = (phus — Phan) = — (plis — Phetector) + Tf the detector is initially 


at rest, then we have 


e =- [(E'/c- Me)? - gr 


2 [ce Jo Ig! — 28M y Mie” 


=2M (E' - Mc’) (8.8) 


where Peco = (E'/c,p). Hence the kinetic energy transferred to the de- 


tector is 
2 q? 
T=E' — Me = —— ; 
E 2M (89) 


and we can compute its mean and its dispersion. We find 


ua deb - dx dq! fe 96 /* A? 


T - 8.10 
us c fo dge 80” 16M m 
2 
scd AP (Ar) em ua 
T?) = = 8.11 
E JE dg?e-8e i» 128M? ib 
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using fj. dzz"e-F* = n!/k"*", and so the dispersion is 


A? 
~ 16M 


AT = ((T- ay = \/(T?) - (Ty (8.12) 
which in this example happens to equal the mean energy loss. 

The detector, predominantly made of nuclei, will have proportional to 
the inverse mean radius of the nucleus, which goes like 1.24*/%, where A is 
the atomic weight. This is almost proportional to the mass and so 


1 


-1 
A — (2045/5) GeV (8.13) 
16A (1.241/3) 
which is about 0.05 GeV = 50 MeV if the target is protons, but about 7 KeV 
if the target is lead! 
Scattering due to the coulomb field of the nucleus is another effect that 
needs to be taken into account. This effect limits the precision with which 
the direction of a particle can be measured. The Rutherford formula? 


do (Q? 1 
do (<2) sin* (0/2) P) 


indicates that the cross section c (0) is large for small 9, and so the probability 
for small deflections is high. Hence in traversing a material, the net scattering 
is the sum total of a large number of small deviations. 

So multiple Coulomb scattering yields a distribution in the net scattering 
angle. This distribution is approximately Gaussian. The root mean square 
of the angle is approximately 


L 
(9) aa = 20MeV vacas (8.15) 


where the incident particles have charge q, velocity Gc, and momentum p as 
they pass through a substance of thickness L and radiation length Xo. This 
last quantity is one we shall now consider. 


8.1.1.3 Radiation Loss 


Charged particles radiate electromagnetic energy (photons) when they accel- 
erate or decelerate. Electrons, being so light, are particularly susceptible to 
this form of radiation loss or bremsstrahlung, or braking radiation as it is 
sometimes called. We saw that the power loss in synchrotron radiation varies 


“Beginning with Chapter 9 we will learn how to calculate this formula and others like it. For 
now you can regard the integral of the right-hand-side of eq. (8.14) as giving the effective 
area a given nucleus presents to the incoming charged particle. 
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like (E/m)*, so only for small mass particles is this effect appreciable. The 
bremsstrahlung energy loss per unit length in a given medium is 


dE E 
B =-2 EE. Xo 8.16 
ae Xo M ote) 


where Xo is a constant characteristic of the medium called the radiation length. 
For fast moving electrons this form of energy loss dominates over ionization 
loss, since as G —^ 1 


dE 1 s dE ¿ 1 (8.17) 
PE mm au . 
da: brem V F= ps H da: ionization y V 1— p7 


Empirically the ratio of these two types of energy loss is approximately 


dE 
(a) men ES ZE : eas 
re 1200Mc 


where Z is the atomic number of the medium and M the rest mass of the 
particle entering the medium. The critical energy E. is that energy for which 
(£2)... ~ ($2). uv From the preceding equation we deduce 

600 


if electrons are the particles entering the medium. 
Table 8.1 lists values of Xy for various materials. 


TABLE 8.1 
Radiation Length for Various Materials 


Material Z Density (g/cm?) Critical Energy (Mev) Xo (cm) 


Liquid Ha 1 0.071 340 887 
Liquid He 2 0.125 220 745 
C 6 1.5 103 28 
Al 13 2.7 4T 9.0 
Fe 26 7.87 24 1.77 
Pb 82 11.35 6.9 0.56 
Air 0.0012 83 30870 


Water 1.0 93 36.4 


8.1.2 Photons 


Even though they are electromagnetically neutral, photons are carriers of the 
electromagnetic force, and so can experience interactions with any medium. 
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In particular, high energy photons (X-rays or y-rays ) traveling through a 
medium will interact with the atoms of that medium and so lose energy [79]. 
The change of photon intensity Z per unit thickness of material is proportional 
to the intensity and so 


di 
(=) = pI > gahe (8.20) 


where p is called the effective absorption coefficient of the medium. The value 
of 7! is the mean free path for absorption of a photon, or the average distance 
through which a beam of photons will pass before its intensity is 1/e of its 
original value. 
The total absorption coefficient y is the sum of the absorption coefficients 
for each process: 
H = Mpnoto F Mcompton F Mpair (8.21) 


which signify the three ways that photons can lose energy in a medium. It is 
proportional to the scattering cross-section, as we shall see in chapter 9. Fig. 
8.2 illustrates the relative importance of these three processes (plus a couple 
of other sub-dominant ones) as a function of energy. Let’s briefly consider 
each. 


8.1.2.1 Photoelectric Effect 


This phenomenon refers to the absorption of a photon by a bound electron, 
which subsequently is ejected from its atom. If the bound electron is in an 
inner shell of the atom then one of the outer electrons will move to occupy 
this lower and more stable energy level by emitting an X-ray photon of the 
appropriate frequency for this transition. Hence the emitted electron can be 
accompanied by an X-ray photon. 

Since the energy for ejection is a chemical energy, this process dominates 
at low energies. Computation of the cross sections involves details associated 
with quantum electrodynamics and atomic physics. Setting M = Me to be 
the mass of the electron, experimentally the behavior is observed to be 


Ze 2 
T for E<mec 
~l ETP e 
Zphotoelec X i z for E > mee (8.22) 


additionally illustrating the importance of this process for high-Z atoms. 


8.1.2.2 Compton Scattering 


At intermediate energies the incoming photon scatters off of electrons. It is 
not energetic enough to produce pairs, but is too energetic to eject electrons 
from atoms. Instead it scatters off on an electron, analogous to the manner 
in which two billiard balls scatter off one another classically. This process is 
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called the Compton Effect. We have already calculated the kinematics in 
Chapter 2, where we found in eq. (2.39) that 


E 
E'— 8.23 
1+ —£,(1— cos 0) ied) 


mec? 


where E is the energy of the incoming photon that is scattered into a photon 
of energy E' at an angle 0 relative to its original direction. The electron is 
initially assumed to be free, a good approximation if the incoming photons 
have enough energy so that atomic binding energies can be neglected. It is 
clear that E’ « E and so the scattered photon loses energy. The cross-section 


scales as 
Z 


7Compton ~ F (8.24) 


and dominates from photon energies in the range 0.1 to 10 MeV. 


8.1.2.3 Pair Production 


In pair production a photon is converted into an electron-positron pair. In 
free-space this is impossible because energy and momentum can’t be con- 
served. The reason for this is as follows. Suppose the photon has a small but 
nonzero mass. If a single photon could produce a pair of particles, then the 
total (rest) mass of the particles would have to be smaller than the photon 
mass, since we can always go to a frame in which the photon is at rest and 
for which the total energy is its rest mass. This would mean that the photon 
could only produce pairs of oppositely electromagnetically charged particles 
that were lighter than half of its mass. Empirically we know that electrons 
and positrons (the lightest electromagnetically charged particles) are much 
heavier than the photon; in fact we will later see that we have very good 
reasons to expect the photon mass to be zero! 

So a free photon cannot decay into (or rather produce) pairs. However, a 
photon traversing the Coulomb field of a nucleus can, because the recoil of the 
nucleus can balance energy and momentum. The threshold energy for pair 
production is twice the electron (or positron) mass, which is 1.022 MeV. The 
cross-section behaves as d 

c 


25 f z? (8.25) 


and we see that it will dominate for high energy photons. Dominance of 
energy loss due to pair-production sets in for photon energies greater than 10 
MeV. 

What happens to the positrons that are produced? Along with the produced 
electrons, they will traverse the medium and lose energy via the mechanisms 
discussed above for charged particles. At sufficiently low energies they will 
capture an electron in the medium and bind to it to form positronium. As 
we saw in Chapter 6, positronium is highly unstable, decaying into a pair of 
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FIGURE 8.2 

Absorption cross-sections of photons as a function of energy in carbon and 
lead. The different contributions are of the photoelectric effect (0,...), Comp- 
ton scattering Ocompton, and pair production (Knuc, due to the nuclear elec- 
tromagnetic field, and ke, the electron field) are shown. Two other relatively 
small effects not considered in the text — Rayleigh scattering (Opayicign in which 
the atom is neither excited nor ionized), and Photonuclear interactions (o, .., 
in which the target nucleus is broken up) — are also illustrated. Image courtesy 
of the Particle Data Group [1]. 
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photons in about 10-!? seconds, with each photon from the produced pair 
having 0.511 MeV. This provides a way of both detecting positrons and of 
calibrating detectors at low energies. 


8.2 Detector Types 


Detection requirements are exacting in particle physics. The key quantities 
of interest that one would like to measure are the position (or rather trajec- 
tory), momentum, energy, arrival time, charge, spin, and any other relevant 
quantum numbers associated with the identity of the particle. In addition, 
it is important to know which particles are associated with each other in a 
given reaction. No one detector can do all this, so in practice combinations 
of detectors are used to extract the information required. 

In a fixed target machine the detector is typically placed behind the target, 
and the geometrical arrangement is shown in fig. 8.3. However, in a collider 
the detector must be placed around the target (fig. 8.4) , in order to maximize 
the amount of information obtained from the collisions. In general, each 
detector consists of a layered array of materials, with each layer sensitive to 
detection of a particular kind of particle. 


0! 
o — ON detector 


particle target 


FIGURE 8.3 

Geometry of detectors in a fixed-target experiment. The different layers corre- 
spond to different kinds of detection materials. Image courtesy of the Particle 
Data Group [1]. 


8.2.1 Scintillation Counters 


Scintillation counters were the earliest counters invented: called spinthari- 
scopes, the first one was made in 1903 by Crookes [80] and consisted of a ZnS 
screen that would flash whenever an alpha-particle hit it. The detector was 
the human eye and the recorder was pen and paper. Geiger and Marsden [81] 
used two screens to perform the first coincidence experiment in 1910, checking 
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FIGURE 8.4 

Geometry of detectors in a collider experiment. The different layers corre- 
spond to different kinds of detection materials. Image courtesy of the Particle 
Data Group [1]. 


to see if two alpha particles were emitted by a radioactive gas within a given 
“short” time. The unreliability of the human eye as a detector led to the 
abandonment of these devices. 


In 1944 these detectors were reintroduced, with a photomultiplier replacing 
the eye. The basic arrangement for such a detector appears in fig. 8.5. When 
a charged particle passes through the scintillator it excites the medium inside; 
as the medium de-excites photons are emitted, providing information about 
the particle's trajectory. It was subsequently discovered that mixtures of 
organic liquids and aromatic molecules yielded high numbers of photons per 
unit loss of energy (about 10,000 photons per MeV) [82]. So the most common 
materials used for the medium in today's scintillators are organic liquids and 
plastics (which emit UV light, which is then converted to blue light via dye 
molecules) and inorganic solids (such as sodium iodide, whose dopants capture 
electron-hole pairs and then emit visible light). 


Photomultiplier tubes use the photoelectric effect to record the photons 
emitted from the medium. These photons liberate electrons at a photocathode 
(whose conversion efficiency is about 2596), which then travel down a channel 
of increasing potential electrodes (the dynodes) which emit more electrons 
causing amplification. There are typically 6-14 dynodes yielding an amplifi- 
cation factor of about 104 to 109. Sodium iodide doped with thallium has a 
high detection efficiency but a slow decay time (about 250 nanoseconds) for 
each pulse. Plastic scintillators (anthrancence or napthalene) can shorten the 
pulse decay time to 10 nanoseconds (with a resolution of about 200 picosec- 
onds) and so such detectors are ideal triggering devices — in other words their 
signal is used to decide whether or not to activate the rest of the detector 
and/or to record the event. However, their efficiency is low. 


The signal that emerges is passed through a discriminator, whose function 
is to eliminate random noise pulses emitted through thermal electron emis- 
sions from the cathode and the dynodes. The final signal is not sharp, but 
instead has a shape that is a consequence of the experimental resolution of 
the detector. This signal is sent to a pulse-height analyzer, which digitizes 
and displays the pulses. 
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FIGURE 8.5 
Schematic diagram of a Scintillation counter. 


8.2.2 Cloud Chambers 


Cloud chambers were first developed by Charles T.R. Wilson around 1911 
for experiments on the formation of rain clouds [83]. The cloud chamber is 
a sealed environment containing a supercooled, supersaturated water vapor. 
When a charged particle interacts with the mixture, it ionizes it. The result- 
ing ions act as condensation nuclei, around which a mist forms because the 
mixture is at the point of condensation. The high energies of the incoming 
particles mean that a trail is left, due to many ions being produced along 
the path of the charged particle. These tracks have distinctive shapes. For 
example an alpha particle’s track is broad and straight, while an electron’s is 
thinner and shows more evidence of deflection. Anderson made use of a cloud 
chamber in 1933 in his discovery of the positron [84]. 


8.2.3 Bubble Chambers 


These descendants of the cloud chamber were invented by Glaser in 1952 
[85]. Unlike scintillation counters, which yield only information about the 
energy of the particle passing through, bubble chambers look at all possible 
processes/particles passing through them. 

They consist of a superheated liquid (one whose pressure is lower than its 
equilibrium vapor pressure, i.e. “hotter than its boiling point” ) placed in some 
(relatively thin) container. As an ionizing particle passes through, the liquid 
will start boiling by forming bubbles at nucleation centers. The superheated 
liquid is prepared by starting with the very cold liquid under pressure (about 
5 atmospheres and 3K) and then, just before the particle beam arrives, the 
pressure is reduced suddenly by using a piston to expand the volume by about 
1%. 

Ions form good nucleation centers, and so a charged particle moving through 
this detector will leave a trail of bubbles. These bubbles are then pho- 
tographed, allowing one to directly trace the actual track(s) of the ionizing 


Detectors 147 


FIGURE 8.6 

An important bubble chamber photograph. A K” impacts upon a proton in 
the reaction K^ + p => 07 + K+ + K?. This photograph (and others like it) 
provided evidence for the QT particle, whose discovery gave strong support 
to the quark model. Photo courtesy of Brookhaven National Laboratory. 


particle(s). In conjunction with a strong magnetic field (which curves the 
tracks provided the field is orthogonal to the plane of the thin container), the 
momentum can be deduced from the track curvature. An example from an 
actual experiment is shown in fig. 8.6. 


Bubble chambers are very useful in studying complicated interactions of 
many particles. The first bubble chambers contained only a few cc's of liquid, 
but the volume rapidly increased to be about a million times larger as new 
ones were built. The best ones produced 35 million photographs per year. 
Unfortunately they cannot be used in colliders because of the interaction 
geometry of the beams. They are also not selective because they cannot 
be triggered — just as a surveillance camera at a bank photographs every 
visitor, a bubble chamber records everything that goes through it, and so 
every picture must be developed and scanned to see if there is any useful 
information present. 
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8.2.4 Spark Chambers 


Short high voltage pulses (10-50 MV/cm) between parallel plate electrodes 
enclosing a gas will yield short electrical breakdowns in the structure of the 
gas. An ion trail will then leave “flashes” or “sparks” of light due to such 
discharges, as Greinacher found in 1931: the passage of an alpha particle 
through such a system generated a spark between the wire and the plate, 
allowing for a direct track image to be photographed with good spatial reso- 
lution. Like a bubble chamber, events must be recorded by some means and 
evaluated later. However, spark chambers operate much more rapidly and can 
be made highly selective by using auxiliary detectors to screen out unwanted 
events. Because of its selectivity, the spark chamber is most useful in search- 
ing for very rare events. They can be highly automated, with data collected 
and stored electronically instead of photographically, as is necessary with the 
bubble chamber. A high-speed computer, which may operate simultaneously 
with the experiment, can then be used to analyze the data. This provides im- 
mediate evaluation of the quality of the data and affords optimum operating 
conditions to be continuously maintained. Very large spark chambers have 
helped to detect neutrinos. A picture of cosmic rays flying through a spark 
chamber at CERN in 1999 appears in figure 8.7. 


8.2.5 Wire Chambers 


Wire chambers were developed by Georges Charpak in 1968 [87] and have 
excellent time resolution, very good position accuracy and are self-triggered. 
A schematic diagram outlining the basic features of a wire chamber appears 
in figure 8.8. These consist of a plane of positively charged wires precisely 
separated (typically by about 2 mm) from each other. The plane is in between 
two cathodes (often made of aluminum foil) that are about 1 cm away. This 
whole structure is encased and then filled with a gas, which ionizes when a 
charged particle passes through it. Upon ionization of the gas by a charged 
particle, electrons drift toward the nearest wire, gaining energy. If this energy 
exceeds the ionization energy of the gas the process will repeat, leading to 
an avalanche of ion pairs. The amplification factor is typically ~ 10°. Sand- 
wiching a set of these counters in concert (see fig. 8.9) allows for accurate 
measurement of position. 

These multiwire proportional counters (MWPC’s), when modified by an 
electric field, cause the ionized electrons to drift before getting to the high 
amplification region near the anode. By timing the arrival of the pulse relative 
to some external fast signal the drift distance can be obtained, yielding the 
position of the particle. The pulse height can be used to measure the mass 
of the particle causing the ionization. Hence the track of the incident particle 
can be reconstructed — with a precision of up to 100 microns! 

If a set of MWPC planes is placed on either side of a region that has an 
applied magnetic field, there will be a change in the trajectory of the particle 
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FIGURE 8.7 

Cosmic rays — created when subatomic particles from outer space collide with 
the atmosphere — can be seen in this photograph of a large spark chamber 
at CERN. Their tracks are visible as a series of sparks between metal plates 
(or planes of parallel wires) several millimeters apart, induced by the passage 
of each charged particle as it moves through the spark chamber (copyright 
CERN; used with permission). 


as it passes from the first set of planes to the second. This change provides 
information about the momentum of the particle, using the formula (7.5) 
pc = qBr, where r is the radius of the curved arc traced out by the changing 
trajectory of the particle of charge q, and B is the magnitude of the applied 
magnetic field. 


8.2.6 Time Projection Chambers 


Time projection chambers (TPCs) were invented in 1974 by David Nygren 
to give spatial resolution of particle tracks in all 3 dimensions over as large 
a solid angle as possible [88]. A drift chamber is filled with gas (usually 
some inexpensive mixture of Argon and CH4) and then uniform electric and 
magnetic fields are applied parallel to the beam axis. lon pairs are produced 
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FIGURE 8.8 
Schematic diagram of a wire counter. 


as the charged particle passes through the TPC. The electric field causes 
the electrons of the ion pairs to acclerate towards one end of the chamber, 
and the magnetic field causes the trajectories to become tiny spirals along the 
magnetic field axis, i.e., parallel to the beam axis. The point of impact of 
the electrons on the end caps yields the projection of the trajectory, giving 
two spatial coordinates. The arrival time of the electrons can be used to 
determine the third spatial coordinate. The total charge deposited at the 
ends gives the total ionization and hence the total energy lost by the original 
charged particle. Since the detectors surround the beam pipe completely, a 
large solid angle is covered. The large number of sensitive elements at each 
end allows observation of many particles simultaneously, thereby providing 
efficient pattern recognition of the event. 


8.2.7 Cerenkov Counters 


Whenever a charged particle moves at a speed faster than c/n (where n = 
index of refraction of the medium), a coherent wavefront forms as a cone of 
angle 0 about the trajectory as in fig. 8.12. Named after Pavel Cerenkov, who 
discovered this effect in 1934 [89], this radiation is called Cerenkov radiation 
and it appears as a continuous spectrum. Measurement of the angle 0 of 
the emitted light, as shown in fig. 8.12, provides a direct measurement of 
B = v/c. Note that if Gn < 1 no signal is observed [90]. 

This effect can also used to identify different particles of the same momenta. 
Consider protons, pions, and kaons, each moving with a momentum of 1 

-1/2 

GeV/c. From the relationship p = myv = mv (1 — 2 Í between velocity 
and momentum, the respective velocities of these particles are 0.73c, 0.99c, 
and 0.89c. Observation of each of these particles using a Cerenkov detector 
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FIGURE 8.9 

Georges Charpak (left) works with students on the multiwire proportional 
counter. Densely packed anode wires and cathode planes are enclosed in 
a gas-tight chamber. Each wire acts as an individual detector similar to a 
proportional counter. A particle leaves a trace of ions and electrons that drift 
toward the nearest wire, which then emits a pulse of current. Several planes 
of wires with different orientations are used to track the trajectory of the 
particle very accurately (copyright CERN; used with permission). 


requires media of differing refractive indices — in this case n must be larger 
than 1.37, 1.01, and 1.12 respectively. If we put two Cerenkov detectors in 
sequence — say one with water (where n = 1.33) and one with a gas where 
n = 1.03, then the pion will register a signal in both detectors, the kaon only 
in the water detector, and the proton will not register at all. 


Velocity resolutions of 1077 have been obtained using these kinds of de- 
tectors. This was the chief detection method used at the Sudbury Neutrino 
Observatory [91], where the medium is heavy water, with n — 1.32828. Neu- 
trinos in these detectors scatter electrons as they enter the water. As these 
electrons travel through the heavy water they do so at velocities larger than 
c/n, and so register a signal which is picked up by photomultiplier tubes. 


'The main limitation of Cerenkov detectors is that a very feeble signal is 
produced. In general the number of photons radiated per unit path length £ 
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FIGURE 8.10 

A view of one of the first full-energy collisions between gold ions at Brookhaven 
Lab's RHIC (Relativistic Heavy Ion Collider). The image was captured by 
the Solenoidal Tracker At RHIC (STAR) detector. Each track indicates a 
path taken by one of the thousands of subatomic particles produced in the 
collision. Photo courtesy of Brookhaven National Laboratory. 


for photons of wavelength A to A+ dA is 


dN 1 dX 


where a is the fine structure constant. The maximum number of photons 
per cm emitted is in the range of 200 to 300 photons. This means that the 
detectors typically need to be several meters long to ensure detection of a 
signal. 


8.2.8 Solid State Detectors 


The newest technology in detecting charged particles was developed in the 
1980s at SLAC [92]. The basic idea is to make use of diodes to detect the 
movement of charged particles. Wafers of very lightly doped silicon are very 
sensitive to the passage of charged particles — for example the formation of 
an electron-hole pair requires only about 3 eV of energy. A diode (ie. a 
p-n junction) will be able to detect charged particles that move through its 
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FIGURE 8.11 

The STAR detector at RHIC, a Time Projection Chamber, that tracks and 
identifies particles emerging from heavy ion collisions, just before its comple- 
tion. Photo courtesy of Brookhaven National Laboratory. 


wavefront 


particle trajectory 


FIGURE 8.12 
Cerenkov radiation. The angle cos 0 = 1/(n provides a measurement of the 
speed of the particle. 


depletion region (the interface between the p and n materials). As the charged 
particle moves through the interface it interacts electrically with the lattice, 
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FIGURE 8.13 

Schematic diagram of a Ring-Imaging Cerenkov (or RICH) Radiation De- 
tector. The electrons produced by these photons will form a ring pattern. 
Image courtesy of SLAC National Accelerator Laboratory. 


creating electron-hole pairs in a narrow column along its trajectory. The 
electrons and holes move in opposite directions under the influence of the 
electric field that has been established by the ionized impurity atoms in the 
wafer. This creates an electrical signal whose location corresponds to the 
trajectory of the particle. 


The depletion region is made as thick as possible (typically 300 microns) so 
that the path of the charged particle is as long as possible, thereby yielding 
a stronger signal. By fabricating the diodes in parallel strips with a spacing 
of 25 microns (see fig. 8.14), a resolution of about 5 to 10 microns in the tra- 
jectory of the particle can be achieved. By gluing two such microstrip wafers 
together at an angle, the passage of a charged particle gives two independent 
signals as it moves through the detector, thereby yielding two independent co- 
ordinates which allows for the specification of a unique set of points in space 
corresponding to its path. Solid-state microstrip detectors are now being used 
in all the major high energy facilities today. They are what permitted the 
detection of the top quark, and (it is hoped) will assist in the search for the 
Higgs boson. 
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FIGURE 8.14 
Section of a typical microstrip plane. 


8.2.9 Calorimeters 


These devices work on the principle that an incident particle generates sec- 
ondary ones, which generate tertiary ones, ... so that all of the incident energy 
is absorbed into the medium. A calorimeter absorbs all of the kinetic energy 
of a particle, yielding a signal that is proportional to this energy. Such detec- 
tors have energy resolution like ~ 1/ VE, allowing for high precision. They 
also permit quick decisions on event selection. 


[CTrackmg] * 
O Hadron Calorimete 


FIGURE 8.15 
Schematic diagram of a modern calorimeter. Image courtesy of the Particle 
Data Group [1]. 
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'There are several kinds of calorimeters, each designed for detection of par- 
ticular kinds of particles. They are often made of an absorbing material that 
is interspersed with sampling devices that determine the energy of the de- 
veloping shower of particles. However, they can also be made of a single 
homogeneous material (for example lead glass) — these have better resolution 
but are considerably more expensive to construct. 


8.3 Modern Collider Detectors 


A modern detector in a collider is layered, with each layer performing a dif- 
ferent function. Fig. 8.16 illustrates the general structure of a such a device. 
Let's look in a bit more detail at each layer. 


8.3.1 Tracking Chambers 


The inner region of any collider detector is called a tracking chamber. Its 
main function is to detect as accurately as possible the trajectories of the 
particles that leave the collision. It is generally a segmented combination of 
multi-wire proportional counters, streamer chambers, and solid-state detec- 
tors. Typically the innermost part of this chamber consists of a set of silicon 
microstrip detectors whose purpose is to provide precise spatial information 
about the trajectories of the charged particles that emerge from the collision 
point. They must be sufficiently thin so as to minimize errors due to multiple 
scattering. Outside of this are drift chambers, typically in an axial magnetic 
field, that provide measurement of the momentum of any emitted charged par- 
ticles. The outermost stage is a set of pre-shower counters, which are about 
3 radiation lengths of absorbing material followed by scintillation counters. 
They provide the first evidence of particles that are electromagnetically inter- 
acting because high energy photons traveling through the absorbing material 
will convert into electron/positron pairs that in turn produce a shower in front 
of the scintillators. 


8.3.2 Electromagnetic Shower Detectors 


High energy photons and electrons can produce cascade showers. The incident 
electron produces photons, which produce ete~ pairs, which produce more 
photons, which produce more e*e^ pairs .... The result is a cascade of pho- 
tons and electrons increasing exponentially with depth. Coulomb scattering 
causes the shower to spread laterally. Almost all of the energy of the shower 
appears as ionization loss of charged particles in the medium. These detectors 
are built from high-Z materials with small Xo (typically about 25 radiation 
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lengths, which is one mean free path for hadronic interactions) so the shower 
is contained in a small volume. They are placed after the tracking chamber, 
and absorb all of the energy of electrons and photons that pass through them. 


8.3.3 Hadron Shower Calorimeters 


'These work on the same principle as above, except that an incoming hadron 
produces secondary hadrons, which produce tertiary hadrons ... . These 
detectors are much larger than the EM ones because the length scale for 
development of the shower is much longer than the radiation length of the 
medium. Another complication is that ~ 30% of the incident energy is lost by 
either exciting or breaking up nuclei. This can be compensated for by using 
2380 as the medium; in 288U extra energy is released by fast neutron and 
proton fission of the nucleus, which compensates for the losses from breakup. 
'These detectors absorb the energy of all of the remaining emitted particles 
except for muons and neutrinos. 


Tracking Electromagnetic Hadron Muon 
chamber calorimeter calorimeter chamber 


Innermost Layer... —————— ——9À ...Dutermost Layer 


FIGURE 8.16 

Schematic diagram of how particles deposit energy in calorimeter detectors. 
Note that the photons and neutrons, being neutral particles, are not detected 
in certain layers. Image courtesy of the Particle Data Group [1]. 
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FIGURE 8.17 
A cross-section of a calorimeter, showing the various tracks of different particle 
decays. Image courtesy of the Particle Data Group [1]. 


8.3.4 Muon Chambers 


The only (known) particles that can make it beyond the hadronic calorime- 
ters are high-energy muons and neutrinos. The muons are detected in the 
outermost layer, which measures their momentum. Their trajectories can be 
traced back to the inner region by checking consistency with the data from 
the previous three layers. Only neutrinos escape undetected. Their pres- 
ence is inferred from conservation of 4- momentum - a lack of momentum and 
energy balance amongst the remaining particles suggests the presence of one 
or more particles that do not experience either the strong or electromagnetic 
interactions. 

How do we know the undetected particle is a neutrino? The best way to be 
sure is to check the overall consistency of the event with a prediction of the 
Standard model. If the event is not consistent with such predictions, then it 
may very well signal the existence of a new particle or a new type of inter- 
action. In order to be sure that all of the missing energy is accounted for, 
the detectors must cover as much of the 47 solid angle around the interaction 
point as possible and the layered detectors must minimize energy loss due to 
their structure. This is a great technical challenge. It is a remarkable achieve- 
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ment of modern physics that such detectors can be constructed, providing us 
with reliable empirical information about the subatomic world. 


8.4 


Questions 


. Antiprotons were first discovered in an experiment by Chamberlain et 


al. that produced them at a momentum of 1.2 GeV. In order for them to 
be detected by a Cerenkov counter, what would its minimum refractive 
index have to be? 


A 400 GeV muon passes through a block of iron 500 cm thick. 'To what 
accuracy can its incident angle be measured? 


Two particles of the same momentum but different mass travel between 
two scintillation counters separated by a distance L. 


(a) What is the difference in their time of arrival? 


(b) What is the minimum flight path necessary to distinguish Kaons 
from pions with momentum 4 GeV/c if the time of flight can be mea- 
sured to an accuracy of 200 picoseconds? 


How long would a gas Cerenkov counter of refractive index n have to 
be in order to distinguish Kaons from pions of identical momentum p? 
Assume that at least 100 photons in the visible range are needed to 
ensure a detection. 


The rate of energy loss of protons traveling a material of density p 
typically follows a power law 


where the constants K and p are characteristic of the material and Eo 
is a reference calibration energy that we can take to be 1 MeV. 


(a) Find an expression for the thickness of material that will reduce the 
energy of a proton by half the value it has upon entering the material. 
(b) For Carbon, K — 212 g-MeV/cm?, and p — 0.757; for Lead, K — 
89.5 g-MeV/cm?, and p — 0.694. How much thicker must a slab of 
Carbon be than a slab of Lead in order to reduce a proton of incident 
energy 150 MeV by half? 


Alpha particles of energy 300 MeV pass through a copper foil 100 mi- 
crons thick. How much energy do they lose? 
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7. In a photomultiplier tube, each incident photon produces n electrons. 


Over N repetitions of this process the average number of electrons pro- 
duced is 
D 
i=1 


and the probability of observing a particular number n of electrons for 
a typical measurement is 


zi 


P (n) = — e” 


a distribution called a Poisson distribution. 

(a) Show that >”, P (n) — 1. 

(b) Compute the average (n) = 7, nP (n) 

(c) Find the standard deviation ((n — (n))”) of the number of elec- 


trons observed. 


. An underground scintillator at SNOlab counts five muons per hour on 


average. Suppose this experiment is run for 1000 hours. How often 
will counts of 2, 4, 6, 8 and 10 muons be recorded? 
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We obtain empirical information about particle physics from a variety of 
sources: from studying cosmic rays, the cosmic microwave background, and 
bound states — the latter is particularly important when it comes to quark 
physics. However, most of what we know experimentally in particle physics 
comes from data on the decays of unstable particles and on the scattering of 
one particle from another. The accelerators and detectors that were discussed 
in chapters 7 and 8 are the primary tools used to measure these phenomena. 
We want to make use of the data that emerges from these experiments to 
modify, corroborate, and falsify our theories of the subatomic world, the ulti- 
mate goal being that of obtaining the deepest understanding possible of the 
laws of nature. 

In order to do this — that is, in order for theory and experiment to make 
contact — we need to deal with quantities that the theorist can calculate and 
that the experimentalist can measure. There are a number of such quantities, 
but the most important two are lifetimes and cross-sections. 


9.1  Lifetimes 


'The lifetime 7 of an elementary particle is the average amount of time it takes 
for the particle to decay into something else. If it doesn't decay into anything 
then we expect the lifetime to be infinite; we say that such a particle is stable. 
In practice we can experimentally only set lower bounds on the lifetime of 
stable particles. For example the lifetime of the proton [93] has been measured 
to be Tp > 2.1 x 102 years; for the electron [94] it is Te- > 6.4 x 10% years*. 
We will assume that particles that have only lower bounds on their lifetimes 
are indeed stable, never decaying into anything. There are only a few particles 
in nature of this type. The vast majority of elementary particles are unstable 
and will generally decay into some other particles. 


*These lifetimes are for the proton and electron to decay into anything — known particles 
or not. If we add the additional requirement that the decays be into known particles, then 
these lifetimes increase by a factor of 100-1000 [95]. 
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In principle, measurement of the lifetime of a particle is simple: just watch 
it to see how long it takes to decay! However, in practice this is complicated 
by relativistic effects (faster moving particles take longer to decay than slower 
moving ones) and quantum effects (particles moving at the same speed will 
decay at different rates due to the uncertainty principle — we can't arbitrarily 
reduce the measurement error of the time it takes to decay). 

Both of these problems may be circumvented by considering a group of 
identical particles in their rest frame. In this case relativistic effects average 
out and although we cannot predict the lifetime of a given particle in the 
group, we can predict (and measure!) the rate of decay of the group as a 
whole. So we are interested in how many particles per unit time, on average, 
will decay in a large group. This is the decay rate I': the probability per unit 
time that a particle will decay (as measured in its rest frame). 

So if we have, say, N(t) unstable particles at time t, the probability that 
any one of them will decay in time At is l'At. Hence the entire sample 
consisting of N(t) particles will decrease by an amount AN where 


AN = - NTAt => N(t) = Noe ** (9.1) 


and so the number of particles decreases exponentially with time. Here No 
is the number of particles in the sample at t = 0. It’s easy to show that the 
mean lifetime is just 7 = 1/T. 

Most particles decay by several routes: for example, the 7-lepton can decay 
into u` +7, +v, or into e. + Ye +1,, or into vw 4 vr, or to a number of 
other types of elementary particles of smaller mass. The total decay rate is 
the sum of the individual rates for each process 


Ptot = > Ti (9.2) 
i=1 
and the mean lifetime is 
1 
T==— (9.3) 
tot 


The branching ratio is the fraction of all decays into a given mode: 


tot 


B; (9.4) 


and in practice this is the quantity of interest. 

'The task of the theorist is then to compute as accurately as possible the 
decay rates I'; for each particle described by his or her proposed theory. The 
task of the experimentalist is to measure each I; as precisely as possible to 
test the theory. 
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9.2 Resonances 


Unstable particles are observed as resonances. These are local maxima in the 
cross-section as a function of energy, or as a maximum in the invariant mass 
distribution of the particles in the final state of a reaction. 

To understand better the character of a resonance, let's consider at atom 
modeled as a massive pointlike nucleus with a charge at the origin surrounded 
by a cloud of equal but opposite charge. If the nucleus is displaced from the 
origin relative to the cloud then the system will undergo oscillations. In any 
given spatial direction there will be an infinite set of damped modes, each 
with a characteristic frequency and width. Since there are three independent 
spatial directions there is a triply-infinite set of modes. Treating the system 
classically, suppose we let X be the coordinate measuring the distance of the 
system from equilibrium, where X = Xy. To a very good approximation we 
will have 


X (t) 2 Xoexp (-55) cos (wt) (9.5) 


for some characteristic frequency w and width T. 

Suppose now that the system is subjected to a periodic force with frequency 
Q and amplitude proportional to Fo in the direction of motion. This could be 
done by directly shining monochromatic light onto the atom. The behavior 
now becomes 


X (t) = Xo exp (-55) cos (wt) 4 kio 


Vo? = 02)? +0272 


and we see that for late times, the first term becomes negligible and the system 
undergoes periodic motion. The average (kinetic) energy of the atom is easily 
found to be 


cos(Qt+y) (9.6) 


1 2k? FEQ? 
(E) = 5 — = Rw) 
4 (w2 — 02)? + 9212 
where m is the mass of the atom, and I have written this average in terms of 
the response function 


mex? FR 
4T? 


(9.7) 


QT? 
(w? — Q2)? + 02T2 
that tells us the general dependence of the average energy of the atom inde- 
pendently of its mass, and the amplitude of the driving force. The response 
function is a maximum when w = Q, and we can approximate its behavior 
near the maximum via 


R(w) = (9.8) 


R(w) c ewes Ex (9.9) 
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an expression known as the Breit- Wigner shape function [96]. 

Applying these ideas to particle physics, we can regard the ground state 
of the atom as one kind of *particle" and an excited level as another kind of 
"particle". T'his may sound a bit strange at first, but each of these states of 
the atom has their own characteristic energy, frequency and lifetime, where 
the latter is infinite for the ground state and finite for all of the excited states. 
'The search for unstable particles in particle physics is analogous to the search 
for unstable atomic states of an atom. To find such unstable atomic states 
we could shine a monochromatic light beam whose frequency we could vary 
and then search for resonant behavior of the atom (initially in its ground 
state) as the frequency is varied. Peaks in the intensity of the scattered light 
will correspond to the excited atomic states (the unstable “particles”). The 
resonance will occur at a definite energy (the value w, which is the “mass” of 
the “particle”) and will have a definite width proportional to r. The shape of 
the peak will be given by the Breit- Wigner shape function. Similarly, to find 
unstable subatomic particles, we collide two beams together and search for 
increases in the number of particles reaching the detector — that is, for peaks 
in the cross-section (described in detail below) — as a function of energy. 

We can also find resonances by producing them as intermediate states. For 
example, consider the reaction 


A+FB=>R+E=>C+D+E (9.10) 


which means that particles A and B collide to temporarily form a particle E 
and a metastable particle R (the resonance), that in turn decays into particles 
C and D. We expect in such cases that the mass Mop of the final state of 
C and D will be described by a Breit-Wigner function peaked at the mass of 
the resonance with width Ir. If there is no resonance then the mass Mop can 
have any value consistent with energy, momentum, and angular momentum 
conservation and will be a smooth function of collision energy that is not 
peaked. 


9.3 Cross Sections 


If we have two particles at our disposal then we can have them collide to 
see what happens. Again this is a conceptually simple idea — just throw the 
two particles against each other (perhaps by placing one as a fixed target, 
or perhaps by throwing both) and measure what comes out. However, this 
simple idea is fraught with complications: due to the uncertainty principle it's 
impossible to aim the particles with arbitrarily precise accuracy. As with the 
previous case, we deal with this inherent randomness by considering groups 
of particles scattering off of one another. 
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Let's consider what happens in a frame where one group of particles (the 
target) is at rest. For example we might consider a beam of neutrons impacting 
upon hydrogen gas (essentially protons). We then have a situation of the 
type described in fig. 9.1. The beam consists of a flux of particles with 
luminosity £: the number of particles per unit time incident on a given area 
that is perpendicular to the axis of the beam. As a particle in the beam 
approaches the target it will experience some kind of interaction with the 
target (or rather, with the potential generated by the target) and will scatter 
off at some angle 0 relative to its initial trajectory. This angle depends upon 
the impact parameter b, the distance by which the incoming particle would 
have missed the scattering center had it not been scattered. The particles with 
impact parameters between b and b 4- db will scatter at angles between 0 and 
0 — d0, where the minus sign expresses the fact that as the impact parameter 
gets larger the scattering angle gets smaller. In general the dependence of 0 on 
b will depend upon the particular type of interaction between the particle and 
the target. In other words, it will depend upon our underlying fundamental 
physical theory. 


FIGURE 9.1 
Particle incident in an area do (the shaded region on the plane) scatters into 
a solid angle dQ (the shaded region on the sphere). 


The beam of particles between b and b+db will form an annular ring centered 
about the beam axis. The part of the beam between angles ¢ and dọ (where ¢ 
is the angle around the annular ring relative to some axis perpendicular to the 
beam axis) will have dN = Ldo particles passing through the infinitesimal 
area do of the annular ring per unit time. This part of the beam will be 
scattered into some solid angle dQ between Q and Q + dQ. 


The differential cross-section D(0,¢) is defined as the ratio between these 
two quantities: 


(9.11) 


166 An Introduction to Particle Physics and the Standard Model 


Hence an experimentalist can measure the differential cross section! by count- 
ing the number of particles scattered into a given solid angle per unit time 
and dividing by the luminosity, which is controlled by the apparatus. Most of 
the time D(0, à) is independent of ¢ since we deal with spherically symmetric 
potentials. 


FIGURE 9.2 
Hard sphere scattering 


As an example, suppose we scatter a particle off of a hard sphere. Our 
"theory" is that the scattering center consists of a potential that is zero ev- 
erywhere except at the edge of the sphere of fixed radius R, where it becomes 
infinite. Hence any particle encountering the sphere will elastically bounce off 
of it, with its angle of incidence o equalling its angle of reflection. We can see 
from fig. 9.2 that the scattering angle 0 = m — 2a. If the impact parameter 
is b and the radius of the sphere is R we have b = Rsina = R cos 2, yielding 
(indirectly) the dependence of 0 on b. Geometrically do = |b db dó| and the 
scattering solid angle dQ = |sin 0 dé do|. Consequently 


do __|bdbdd| | |Rcos$d(Rcos$)|  R?cos$sin$ m? 


dQ  |sin6d6dó| - [sin 0 dé | |. 2sinÜ 4 


(9.12) 
and integrating over solid angle we have o = dQ = s R2. So the effective 
8 4 


tNote that in mathematical terms the quantity D(6, $) is not really a differential at all. In 
practice people refer to 42 as the differential cross-section, and sometimes even abbreviate 
this to just do, with the implicit understanding that an integral over the solid angle must 


be carried out to find c. 
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area the sphere presents to the beam is the projected area of its forward 
hemisphere: any particles within TR?will scatter; all others will not! 

'This example illustrates the character of a cross-section: it is the effective 
area that the target presents to the beam or, more generally, it is the effective 
area that one particle presents to another. However, it is important to be 
aware of the limitations of this example — in general the target is not one that 
you either hit or miss (as was the case here) but rather is one for which the 
closer you come the greater the deflection. The dependence of the differential 
cross-section as a function of angle (in turn determined by underlying theory) 
in general terms encodes this “soft” behavior. 

For cross-sections, we often are interested only in the case where the final 
state consists of a particular type of particle (or set of particles). As with 
the decay rate we have many possible outcomes for a given set of incident 
particles, each of which will have their own cross-section c;. The total (or 
inclusive) cross-section is obtained by summing over all possible channels (or 
modes) of scattering: 


tot = 5 0; for n possible modes of scattering (9.13) 
i=1 


In general c is inversely proportional to velocity, since the cross section should 
be proportional to the amount of time the incident particle spends near the 
target. Hence if we graph c vs. velocity (or, more commonly o vs. Energy), 
we expect to see it montonically decrease. However, this behavior is not 
universal: a bump in this graph means that the particle “likes” to be near 
the target at that energy — the target presents a much larger area to the 
incident beam. This is another example of a resonance, which in this case is 
a short-lived semibound state of particle and target. Hunting for resonances 
in the plot of o vs E is the chief way in which new (unstable) particles are 
discovered. 

'The absorption coefficient described in Chapter 8 can be related to the 
scattering cross section by recognizing that any particle scattered out of a 
beam produces an increase in the counting rate for scattering or equivalently 
a drop in the beam intensity. Suppose that there are n scattering centers per 
unit volume of material, each scattering center having a cross-section o. Then 
number of particles scattered through a material of thickness dx is nodx, and 
this must correspond to the fraction of the beam dZ/Z that is attenuated. 
Hence 


di 
nodxz = E = pdx (9.14) 


using the definition of absorption coefficient from eq. (8.20), which implies 


p=n0 (9.15) 
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9.4 Matrix Elements 


The essential job of the theorist is to take his or her favorite theory and use 
it to compute [ and do for various processes of interest. These quantities are 
computed from something called the scattering amplitude [97]. 

The scattering amplitude (or S-matrix element) is defined to be 

Sif =T>00 (P|S| V), , = S-matrix element (9.16) 

ie. it is the amplitude for some state |W) in the distant past (where we 
idealize all particles emitted by the source to be initially non-interacting) to 
evolve into some other state (®| in the future (where the detector is again 
idealized to absorb only free non-interacting particles). This action is just a 
time translation, so S is unitary. 

Most of the time in a scattering reaction, particles just zip by each other 
and nothing happens. To account for this we write 


Sip = if + Map = 1+ i (p1 Pm M| pi Pn) (9.17) 


where the “1” is the identity matrix in the Hilbert space spanning initial and 
final states. It is the matrix element M — the part of the S-matrix that is not 
the identity — that a theorist computes from a given theory. 

Quantum-mechanically, the probability for scattering is given by the square 
of the magnitude of the amplitude. Hence we expect for decay rates that 


dT x (o) += phn IM p) (9.18) 


since there is just one initial particle, and for cross sections 


do x |(p ++ Pin IM prp2)|” (9.19) 


since there are two particles in the initial state. 


You might think that we could begin calculating once we had M, but this 
isn’t quite right for several reasons: 


(a) we need to conserve momenta between initial and final states 


(b) because of the uncertainty principle we can’t prepare our initial states 
with exactly the momenta p; 


(c) again, because of the uncertainty principle, we can't precisely measure 
the final momenta p; of the outgoing particles 
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The first problem (a) is easy to correct. We just multiply by a ó-function 


that ensures the sum of the incoming 4-momenta equals the sum of the out- 
going 4-momenta 


(2^ 54) (>: P; — P2 — 3 


i=l 
= (2a) s (Es — Pa -ñ,) ô (>: Ege 3 (9.20) 
i=1 ¿=1 


where the factor of (amy is a convention. To address (b) and (c) we should 
multiply by the uncertainty (Ap)? in momentum for each p; and for p» and 
pi. However, this quantity isn't Lorentz-invariant. Instead, we multiply by 
(Ap)? 


“22. This is because in a frame moving with velocity 9 = v/c along some 


axis we have 


(ap? (Am) APL) s (A (py - BE) (45) 


E! E y (E - Bpy) 
E (G -525 )) (Api) (Api) _ (1- 88D) (Ap)? 
- E(1— p24) .  E(1-8%) 
_ (Ap)* (9.21) 
E 


where p, is the component of momentum orthogonal to the direction of the 

moving frame and pj is the component parallel to the moving frame?. So it 

is the quantity (Ap? that is Lorentz invariant, and we multiply by this. 

Putting it all together, for cross-sections we get 

m 3 
(Api) , / 2 

do x W = Wo | | ————3 liic Pm M| pipa 

laen 1771 Pm M| pipa)! 


i=1 


x (2450 (E p; — p2 — J 
i—1 


(Ap? (A, AN (9.22) 
2E! (2xh)? ) V 2E 2)? 2E!, (21h)? 


(Api) i (Apa)? 
2E; (27h)? 2E (27h)* 


x (250 (E p; — p2 — 3 
i—1 


(Api)? (Ap) 
2E, (27h)? 2E (2xh)? 


= Eo 


2 
x |(pi Pm M| pip2)] 


*The prime in eq. (9.21) refers to the moving frame (and not the final state). 
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where (2xh)? is the quantum phase space volume and W is a normalization 
factor. In order to turn the proportionality into an equality, note that 


to final state 


= sis incident ES incident 
B ` X (target) density / * \ (beam) flux 


, 3 relative 
= SW+ Ni + | 5 Gea] 
S 


= 2 —W 
MiM Ip - Vi| 


dee = Geer of ue SH 


(9.23) 


where Ni and A3 are the incident particle densities and S is a statistical 
factor that corrects for overcounting of identical particles (we'll look at it in a 
bit more detail below). Now recall [98] that the density of states for a particle 
with quantum numbers nz, nyand nz is 


3 3 
(Ap)? = (=) (An)? = o) (22H) oc (2x8) N (9.24) 


where the particle is in a volume V — L?. Applying this to the incident 
particles, we have 


SWo | (Ap)? / / 2, 
do = = = i Pail M 
MiM |v — vl II 2b! (2x)? pi p | | p1pa)| 
AUN 4 s(4) = / 
x AE, E; (2x) 6 2,1 — p2 — pı (9.25) 
or 
nS m c (Ap? 


do = 


1 1 2 
T Ip +++ Pin M| pipo)] 
Av (pi pa)? — p2p? Lic 2E; (27) 


x (20) 5 (>: P; — p2 — 3 (9.26) 
i=l 


choosing the normalization Wọ to obtain equality and the correct units. This 
is the cross section for a process in which particle 1’ has a 3-momentum that 
lies in the range Ap! around the value p}, particle 2’ has a 3-momentum that 
lies in the range Ap} around the value ph, etc. Typically the momenta of all 
but one of the particles (say particle 1’) in the final state are integrated over; 
what remains after integration is the differential cross-section for particle 1’ 
to emerge into the solid angle dQ. 
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If we only had one particle in the initial state, we'd be interested in the 
decay rate, and a similar analysis would give 


Pr CAL erg ass a 
Me | E Pa MB Gn)" 5r- p) (9.27) 


where the decaying particle initially has momentum p. 


SC 


pe 2hE 


9.4.1 General Features of Decay Rates and Cross-Sections 


Some comments on these formulae are appropriate. 


1. The factors in front of the cross-section and decay rate come from 
Lorentz covariance. It is not hard to show that dI has inverse time- 
dilation properties (i.e. it transforms as an inverse time). The factor in 
front of the expression for do can be written as 


(pı p2)* — pipi = E1 E2 [vo — 0, | (9.28) 
for particles in a head-on collision, where py = —pi. 


2. The factor S is a statistical factor — it equals 1/n! for each group of n 
identical particles in the final state. For example, if a particle decays 
into four identical particles of one kind, and three of another kind, then 
Scar 

3. The factor | JJ; ari)" (27) 5) Oi 

: i=1 ZEO 1 P; — p) is called the phase 
space of the final state, where E; = \/|p,"|2 + m2. Physically it is the 
kinematic information in the scattering process: it tells us how many 
different ways the available energy and momentum can be partitioned 
into the final state. If a heavy particle decays into very light ones, then 
lots of phase space is available because only a little bit of the initial 
energy will go into the rest masses of the final particles. But if a particle 
decays into one close to its own rest mass (such as a neutron decaying 
into a proton, an electron, and an antineutrino), then very little phase 
space is available, reducing the decay rate. The phase space factor is 
Lorentz invariant. If we want to sum over all possible final states, we 
must integrate over this quantity: 


a ccm o j^ dd 


V pi po)? — rip pa)” — p?p? 


x (22)* 60 be P; — p2 — ») (9.29) 


i=1 
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ai Hagens a E MP Ems (3-2) (9.30) 


which gives the total cross section (or decay rate) for the process under 
consideration. Here I have written |M|? = |(f |M| i^ as the square of 
the matrix element between initial and final states. 


= E 


4. The quantity |M| is a scalar function of the momenta. If the particles 
emitted have spin, then |M] is still a scalar function, but it can now 
depend on things like p'- 51, p,- 59, and $1 - $2. However, we will always 
work with spin-averaged quantities, in which case these terms average 
out to zero. 


One other comment: The factors of (27) can seem like a nuisance, but they 
are essential to get the numerical values of the answers correct. And they are 
not too hard to remember: just put a (27) in the numerator for every ó (so 
9%) gets a (27)* and put a (27) in the denominator for every d (so each d?p; 
is divided by (27)?). Once you've done this, you can of course cancel them 
out accordingly. 

Enrico Fermi called the formulae (9.26) and (9.27) for computing cross- 
sections and decay rates the Golden Rule [99], though most of the work leading 
up to it was done by Dirac [100]. Basically the rule says that to find the 
transition rate for any process, take the modulus of the amplitude, square it, 
and multiply by the phase space. The Golden Rule not only occurs in particle 
physics, but also in non-relativistic time-dependent perturbation theory in 
quantum mechanics [101], where you may have already encountered it. 


9.5 2-Body Formulae 


The general formulae for decays and cross-sections are rather formidable. For- 
tunately they really simplify if — as is very common - there are only two par- 
ticles in the final state. This will prove to be very useful in understanding 
much of particle physics, so we will list these formulae here. 


9.5.1 2-Body Decay Rate 


In the rest frame of a decaying particle the 2-Body decay rate becomes 


$ plc 
T 2 Boay E e ¡MP (9.31) 
8a (Mc) 
where p is the momentum of either of the outgoing momenta in the final state 
(it's the same for each final particle since momentum is conserved), and M is 
the rest mass of the decaying particle. 
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9.5.2 2-Body CM Cross-Section 
'The cross section can have two useful forms if the final state has only two 


particles. In the center-of-momentum frame (CM frame), when both particles 
collide head-on 


2 2 (5 "EN 
(35) = (=) S|M| a | CM frame: i =P 
dQ 2-Body CM 87 (E, + Ez) ipl i 


FIGURE 9.3 
Two body scattering in the center-of-momentum frame. 


9.5.3 2-Body Lab Cross-Section 


In the lab frame pə = 0. The differential cross section becomes somewhat 
messy in this frame unless we have elastic scattering (A + B — A + B), in 
which case 


(35) 7 ( r ) SIMI I 
dQ 2-Body elastic-LAB 8m M3 [pi | [2x] (Ey + Mac?) ES Ip: Ei cos 0| 


T p, = 0 
Lab frame: D = Mac? (9.33) 


Note that these formulae are valid no matter what |M| is. This is why they 
are so useful — we don’t have to do the delta-function integration every time. 
I've left the derivation of these formulae for you to do in the problems at the 
end of this chapter. 
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1 A 
P 
P; res: 
P» 
FIGURE 9.4 
Elastic scattering in the Lab frame. 
9.6 Detailed Balance Revisited 
The factor 
IM] = KFM i| = pi Pm M| pip2)] = |M gil (9.34) 


is what we compute from a given physical theory such as quantum electro- 
dynamics (QED), or quantum chromodynamics (QCD). For a given physical 
process, different competing theories will give different values for this factor: 
this is where theory and experiment “meet.” For particles with spin we have 


IM] = JG ED] = pisi iP Sm IM pis 51; P2, 82) = [Mg] (9.35) 


The spins can really complicate the formulae for I or ø. Fortunately we 
can avoid this complication in most cases because experiments use beams of 
unpolarized particles for the initial states and detectors are insensitive to the 
final state spins?. Hence we define 


VID average over initial spins 2 
|M| = — JIM] (9.36) 
and sum over final spins 
1 |o / / 2 
= TE M ; 
(25, + 1) (255 + 1) 5 , |(p1; 51; Pm: Sm | | p181; P282)| 
$1,92,S8 
whenever the incoming or outgoing particles have spin. 
Note under time-reversal and parity 
PT (pi, 51; i Pm Sm M| p1,51 5 P2, 82) 
—P(-p,—51—92, —82 |M] — Dy, 91:5 Pim Sm) 
= (pi, —511 pa — 82 |M] pi, 81; i Po —Sm) (9.37) 


SThis is true most of the time and it is the only situation we will consider. However, keep 
in mind that for polarized beam experiments, such as those carried out sometimes at the 
Stanford Linear Collider this is NOT true, and a more detailed analysis must be carried 
out. 
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and if these are both symmetries of a given interaction then 


Y pios ipse IM p1513 p282)|" 


$152 s' 
2 
= Y Musis — 82 IMI py, 55r —Sm)] (9.38) 
$192 s! 
since summing over a given spin from —s to +s is the same as summing from 
+s to —s. Inserting factors of (25; +1) to take care of spin-averaging gives 


[[ 2s: +1) Mal = [25+ 1) Mal (9.39) 


i=l i=l 


3 


which is the principle of detailed balance! 


9.6.1 Pion Spin 


Let's apply this to a general 2-Body scattering situation. The 2-Body cross- 
section is 


lic? a! = 
«(4B ^ CD) = (5) Sep [ ext (9.40) 
0 


in the CM frame, with Ex + Ep = E. For the reverse reaction carried out 
at the same total energy we find 


licN? Sap |p — 
0 


and so detailed balance implies the relation 


c(AB— CD) (250 1) 2sp +1) [P Sop 


= 9.42 
c(CD— AB) (2s4+1)(2se+1) |p? Sap pu 
which can be tested in experiments. 
For example, consider the reaction 
p+p—at+D (9.43) 


which is the collision of two protons forming a deuteron D and a pion. We 
have from the above 
o(ptp—at+D) _ (287 +1)(254+1) [Pr]? 1 


c(r*-D— pcp) (2s) +1) (25, +1) |n? (1/2) (9.44) 


where we have a symmetry factor of > because the protons are identical. 


Independent measurements indicate that the proton spin is i and the deuteron 
2 22 
| = [pp] 

o(p+p—>"?+D) _ (28, +1) 


= 9.45 
o(r++D—p+p) 2 (pto) 


spin is 0, so for |p, 
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Measurements by Cartwright, Clark and Durbin [102] showed that 


o(nrt +D — pt+p) 
Cp rp aa +D) 


= 2.0 + 0.4 = s, =0 (9.46) 


from which we empirically determine that the pion is a spin-0 particle! 

As a final comment, for the weak interactions, parity and time-reversal are 
not symmetries, and so detailed balance does not hold in general. However, 
first order perturbation theory in the weak interactions indicates that Mif = 
M fi, and so detailed balance will hold to lowest order in the weak coupling. 


9.7 Questions 


1. Show that the 2-Body decay rate is 


S |p|c 
E |p| ; IMP 
8rh (Mc) 


2. Show that the formula for the 2-Body cross-section in the CM frame is 


ES (ey S MP [i| 
dQ.) CM 81 (Ei + E2) A 


3. Show that the resonance function 


or? 
(w2 — Q2)? +0272 


R(w) = 


can be approximated by the Breit-Wigner function near w = Q. 


4. Consider a non-relativistic particle of mass M scattering off of a fixed 


repulsive potential V = A. where k is constant. 


(a) Find the scattering angle 0 as a function of the impact parameter b. 
b) Find the differential cross-section for this process. 


( 
(c) Compute the total cross-section. 
( 


a) Show that 


(p1: p2)? — p?p? = (Er + Es) |51| = Ey E2|U — Vy 
in the CM frame. 
(b) Compute this quantity in the lab frame. 
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6. For elastic scattering A+ B — A+ B in the lab frame, show that the 
differential cross-section is 


ES EC 8 MP I 

djian  X8m/ Me |pi| [py | (E1 + Mac?) — |pi| E, cos 6] 

and simplify it in the limit that the recoil energy of the target B is 
negligible, i.e. mpc? >> Ea. 


7. Find the expression for the differential cross-section in the lab frame 
when the final state particles are massless 


8. For elastic scattering with the incident particle massless, find the ex- 
pression for the differential cross-section in the lab frame. 


9. Find the general expression for the decay rate of a massive particle into 
two massless ones. 


10 


A Toy Theory 


DOI: 10.1201/9781420083002-10 


We have now established all of the necessary background for confronting the- 
ory with experiment. The key task from a theoretical viewpoint is to compute 
the matrix element M for a given process. However, before trying to evaluate 
“real-world” processes such as those that occur in Quantum Electrodynam- 
ics (QED) or Quantum Chromodynamics (QCD), it’s useful to look at what 
physicists call a “toy”-theory: a theory that is not necessarily intended to 
model the real world but whose purpose is to illustrate the essential features 
of the method. 
We are interested in computing the matrix element, i.e., the quantity 


IM] = KG IMI) | = pisi Pm Sm IMI pio 815 p2, s2)| = Mil (10.1) 


for a given theory. The key method for doing this is to use a set of diagrams 
called Feynman diagrams*. Each diagram corresponds to a set of mathemat- 
ical rules (called Feynman rules) that are in turn used to compute the matrix 
elements of interest. This is the method that we want to illustrate with our 
toy theory. 

In order to properly understand where the rules come from we need to make 
use of more advanced methods from quantum field theory — an extension of 
quantum mechanics that allows for the creation and annihilation of particles 
in physical processes. This requires a rather formidable mathematical back- 
ground that is beyond the scope of this book. Fortunately we don’t need to 
have all of this background in order to actually employ the rules to compute 
matrix elements. So the approach that I will take here is to show you what 
the rules are and how to use them. If you want to know for a given theory 
how to derive the rules (in other words find out why the rules are what they 
are) you will have to study quantum field theory [106]. 

We can see from the above that in general a matrix element depends on 
both the momenta and spins of the initial and final states. Spin is an essential 
feature of the real world (all matter has spin-1/2, and photons have spin-1) 
but unfortunately its inclusion leads to a fair amount of messy algebra when 
calculating cross sections, decay rates, etc. Fortunately spin has nothing to do 
with how to calculate a Feynman diagram as such. Hence in our first attempt 


*These were developed by Richard Feynmann in 1949 [103] and were concurrently derived 
from quantum field theory by Freeman Dyson [104, 105]. 
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to learn the rules we would like to get rid of the complications introduced by 
spin. 

This means that our toy theory will contain only spin-0 (or scalar) particles. 
In order to calculate M we will need to know what the Feynman rules are 
for such particles. These rules are of two types — rules (and conventions) that 
are valid for any physical theory, and then rules that are specific to our toy 
theory. 


10.1 Feynman Rules 


Let's first look at the rules that hold for any physical theory of spinless par- 
ticles. For simplicity I will set the speed of light c and Planck’s constant h 
equal to one, restoring them in the final expressions for the cross-sections and 
decay rates. 


1. NOTATION. Label the incoming (outgoing) four-momenta by pi, ps, .. ., Pn 
(p, po. ...,p,,) and label the internal four-momenta q1,q2,...,qj, as shown 
in figure 10.1 


FIGURE 10.1 
'The general form of a Feynman diagram in a theory of spinless particles. 
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Although fig. 10.1 is a diagram in momentum space!, you can think of 
time as flowing from bottom to top and of space as being in the horizontal 
direction. In diagram 10.1 we have m different (spin-0) particles coming in 
from distant regions, idealized as free plane-wave states in the distant past 
from where they are produced by some source. They then collide together, 
interacting in a region that is symbolized by the “blob” in the middle, which 
refers to all possible processes that can cause the initial state to become the 
final state. We will see that these interactions can be understood in terms of 
particle exchange — in fig. 10.1 we see that there j particles exchanged during 
the interaction. After all interactions have taken place there are m particles 
produced that fly off to distant regions, again idealized as plane-wave states, 
where they may be detected. Note that n,m and j are not in general equal. 

These processes are dictated by the laws of physics; the rules encode these 
laws into the diagram. The labeling (p’s for external particles, q's for internal 
ones) is a convention — you can of course choose any labels you want for the 
momenta — but in first learning how to use Feynman diagrams it is helpful to 
stick to a common set of conventions. All physical theories have diagrams of 
this basic form. 


2. INTERNAL LINES. As noted above, inside the blob there will be various 
particles interacting with one another. The trajectory of each particle is 
represented by a line. In our toy theory with only scalar particles, each 
internal line contributes a factor as follows: 


q q -m 


FIGURE 10.2 
An internal line in ABB theory. 


where m is the mass of the particle of momentum q. The factor S is 
called the propagator. 

You might think that the propagator should be infinite — after all, isn't 
the square of the 4-momentum of a particle equal to the square of its mass, 
as in eq. (2.34)? However, for internal particles, which are called virtual 
particles, q? 4 m?. Only for free particles in plane-wave states does q? = m?. 


Particle physicists say that a virtual particle is not on its mass shell, which 


'This is because it corresponds to the Fourier transformation of the amplitude in physical 
space, another result from quantum field theory. 
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is another way of saying that q? 4 m?. Virtual particles do not obey 
their free equations of motion (considered in Chapter 11). Rather they are 
regarded as being in intermediate states (hence the term “virtual”) — they 
are not themselves directly detectable but instead mediate physical processes 
that change initial states into final states. 

Note that since there are two particles in our toy theory, we will have two 
propagators: one for particle A ( vm) and one for particle B (aor). 

Rule 2 depends on the spin of the particle, as we will see later on when we 
look at realistic physical theories. All spin-0 particles contribute this factor 


to any Feynman diagram. 


3. CONSERVATION OF ENERGY AND MOMENTUM In order for interac- 
tions to take place we will need to define something called a vertex, which is a 
point at which interactions take place. We shall do this later, since the rules 
for a vertex are specific to a given theory. However, regardless of the theory 
there is a general rule stating that for each vertex, insert a delta function 
factor of the form 


mt (Ky +k + kg E -- ew) 


where the k’s are the four-momenta coming into the vertex (i.e. each k^ will 
be either a q“, a p", or a p^). If the momentum leads outward, then k^ is 
minus the four-momentum of that line. This rule imposes conservation of 
energy and momentum at each vertex (and hence for the diagram as a whole) 
because the delta function vanishes unless the sum of the incoming momenta 
at the vertex equals the sum of the outgoing momenta. All physical theories 
respect this rule. 


4. INTEGRATE OVER INTERNAL MOMENTA For each internal momentum 

q, insert a factor 

d*q 

(27)? 
and integrate. The idea here is that the virtual particles will — for any given 
process — have any momenta possible that is consistent with conservation of 
energy and momentum. On probabilistic grounds, all such possibilities will 
occur and so we have to add them all up - this is what this integration does. 
The rule holds for all physical theories. 


5. CANCEL THE DELTA FUNCTION 
After performing the integrations the result will include a factor 


(27)t8® (p, +p + pi, — Pi — £2 —::: pn) 


corresponding to overall energy-momentum conservation. Cancel this factor 
(because we have already included it in our expressions for decay rates and 
cross-sections), and what remains is —iM. The rule holds for all physical 
theories. 
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After a bit of practice you can compress rules 3,4, and 5 into a single rule: 
integrate over all undetermined internal momenta. This way is faster, but 
in first learning the method it is easier to break this up into these 3 distinct 
rules. The factors of (27) are necessary — their origin is rooted in Fourier 
transformations of wavefunctions of the various states!, and they must be 
included if the correct numerical answer is to be obtained. Most of them 
cancel out against each other, but you must keep careful track of them. 

Except for rule 2, all of the above rules hold for any physical theory, and 
rule 2 holds for any spin-0 particle. However, we don’t yet have a theory 
because we don't know how the particles can interact. This is where the next 
rule comes in. We will suppose that we have 2 spinless particles — A and 
B (with masses m4 and mg respectively) — and then propose a rule for how 
they can interact. This kind of rule — dependent on which particles exist and 
how they interact — is what distinguishes one theory from another. The rule 
is the “vertex factor" rule. For our A, B theory we will propose the simplest 
rule possible [107]. 


6. VERTEX FACTORS For every interaction between particles A and B, 
draw a point with three lines coming out signifying one A particle and two B 
particles as shown in fig. 10.3, and include a factor of —ig for every vertex that 


FIGURE 10.3 
The vertex of ABB theory. 


appears in a given diagram. Due to the choice of interaction, I will call this 
theory? AB B-theory. The quantity g is the coupling constant of AB B-theory: 


tAgain, a proper derviation of this follows from the advanced methods of quantum field 
theory. 

3Many particle physicists would prefer to call this a model rather than a theory, reserving 
the term theory for the more general formalism that describes interacting quantum scalar 
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it measures the strength of our proposed interaction. 

Of course I could have proposed other interactions with still other particles. 
For example either of the two vertices in fig. 10.4 would be valid choices for an 
interaction between three scalar particles (A, B, and C). The theory described 
by the vertex on the right allows for only one interaction where each kind of 
particle meets together at a single point in spacetime. The vertex on the left 
corresponds to a theory having what is called a 4-point interaction: in this 
theory, the only interaction that can take place is one in which four particles 
meet at a single point in spacetime — two must be distinct and the B particle 
must appear twice. I could also draw other vertices, and I could use more 


FIGURE 10.4 
Other possible vertices for theories of spinless particles. 


than one vertex in my theory. 

So which vertices should I use? This depends on my mood, which is why 
we call this a “toy” theory: the interaction is something I invent for fun. Less 
fancifully, theorists develop toy theories to illustrate ideas and/or methods 
that they believe have something to do with reality. In the real world, the 
vertices model interactions that we hypothesize to be true in nature. Guided 
by experiment, in a real physical theory, we want these interactions to accu- 
rately describe the world we observe. For now we'll stick to ABB theory, that 
has only the single 3-point vertex in figure 10.3. 

An important difference to note here between the toy theory and a real- 
world theory is that the coupling constant g has units of momentum. In 
real-world theories (like QED), the coupling constant is dimensionless. The 
dimensionality of g will have physical consequences, as we will see when we 
consider particle decay in ABB theory. 

'There is one more rule that we need in order to be able to start calculating. 
This rule is the one that tells us how to construct the diagrams from the rules 


fields. I shall stick to the term theory, but you should keep this subtle distinction in mind. 
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and put them all together. It’s called the topology rule, and holds for all 
physical theories. 


7. TOPOLOGY To get all contributions for a given process, draw diagrams 
by joining up all internal vertex points either to the external lines or to each 
other by internal lines in all possible arrangments that are topologically in- 
equivalent, consistent with rule 6. The number of ways a given diagram can 
be drawn is the topological weight of the diagram. The sum of all diagrams 
is equal to —i M. 


Given the above set of rules, there are infinitely many possible diagrams 
that one can draw for any physical process. So how could we possibly add 
them all up? The answer is hidden in rule 6: each diagram has a factor of a 
coupling constant. If the coupling constant is small, then diagrams with more 
factors of the coupling (more vertices or more interactions) are less important 
(are numerically smaller) than diagrams with fewer couplings. So in practice 
we group diagrams by the number of powers of the coupling, and truncate the 
sum at whatever given order of the coupling we desire. In other words, the 
diagrams are to be understood in a perturbative sense: the matrix element 
will have the general form 


iM = — iM 494+ — iMog?+ — iM3g°+--- (10.2) 


where Mı is computed from a finite number of diagrams of order g, Ma 
from a finite number of order g?, etc. In understanding the basic physics in a 
scattering process, it is typically sufficient to retain only the terms of leading 
order. 

Now we're ready to calculate some processes in ABB theory. 


10.2 A-Decay 


The simplest diagram we can draw is given in figure 10.5. It has no internal 
lines and only one vertex. Applying rule 3 we get 


—iM = —ig(2n)^8(9 (py. + pp, — pA) (10.3) 
and then discarding the overall ó—function (rule 5), we get 
—iM =-ig> M =g (10.4) 
The two-body decay rate is, in the rest-frame of the A: 


S plc pig? 
= mee IMP = AT (10.5) 
srh (Mc) 16rhm%c 
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B B 


FIGURE 10.5 
Decay of the A particle. 


where p is the magnitude of the outgoing momentum of either of the two B- 
particles and S = 2 since the final particles are indistinguishable. Momentum 
conservation due to the 6-function implies 


2 2 
ma = IB + m3, + y/ 10? + m3, 


2 
NL -m (10.6) 
and so we get 
g’ mh 
r= 10. 
32nhcmA m Bom 


for the decay rate of the A particle. 

For any given theory the masses mA and mp are fixed, reflecting the fact 
that in the real world an elementary particle cannot change its rest mass. 
However, we can mathematically adjust the mass to be whatever we like — in 
so doing we are really comparing a whole continuum of theories dependent on 
the choice of the particle masses. We can also do the same for the couplings. 
Such comparsions afford important physical insight. 

For the decay process above, we see that the smaller the value of g, the 
smaller the decay rate, in accord with our intuition that a weaker coupling 
makes for a less probable interaction or a less likely decay. If mg > *5^ (i.e. 
if the B-particle has a rest-mass more than half that of the A), then the decay 
rate becomes imaginary — in other words there is no decay! This corresponds 
to a stable A particle as we expect. But note that for large ma, the decay 
rate T ~ ma: : it is small, yielding a long lifetime for a heavy A particle. 
This leads to the counter-intuitive result that a heavier A particle (i.e. heavy 
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relative to the B) would decay less rapidly than a lighter one! We actually 
find a maximum for the decay rate when m4 = 2V2m p, and so a toy theory 
with this mass value would have the shortest-lived unstable A particle. We 
will see later on that a similar situation exists in the real world for pion decay, 
in which it is less probable for the pion to decay to relatively light products 
than relatively heavy ones. 

'The reason for this strange result — contrary to what we saw in Chapter 
9 from general phase-space considerations — is that the coupling constant g 
has units of momentum (or alternatively of (mass) x (speed of light)) in the 
toy theory, whereas the analog of the coupling g for a real world theory is 
dimensionless. In the real world, it is the matrix element that provides the 
proper dimensionality to ensure that I has the correct units. This typically 
means that it introduces additional powers of the mass, forcing T ~ m and 
leading to the result that in the real world heavy particles typically decay 
with more rapidity than light ones. However, there are exceptions to this as 
the toy theory illustrates, and we will see in Chapter 21 that in the real world 
the pion is one of them. 


E: SeSe e 


10.3 Scattering in the Toy Theory 


Let’s compute the cross section for the elastic scattering process B + B — 
A+ A. The diagrams are (to lowest order in g — rule 7) given in figure 10.6, 
where we note that we combine the diagrams in all possible ways consistent 
with rule 6. T'he diagram on the left gives 
¡M1 = (10.8) 
(cio? | EL, O (Wh +a- pr) (LANE (0h — a — 02) 5 
—— (27)4 MEE eI o i EL q? —m2 
—— 


rule6 rule4 rule 3 rule 3 rule 2 


and the one on the right gives 


. S SD d'q 4 s (4) (pf 
ip eed | (2n)45 (p, +4- p) 


(27) 
i 
x(27)t8® (p, — q — p2) 53 (10.9) 
q — MB 
The integrations are easy, since? 
d'g 454) 
(27)! (21) 9^ (q — k) F(q) = F(k) (10.10) 


See the appendix on Dirac delta functions if you are rusty on this. 
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FIGURE 10.6 
Lowest order diagrams for B+ B — A+ A. 


Integrating over q we find that the first delta-function gives for M r 


Mr = (20450 (pf, + ph — pr — p2) 5 (10.11) 
(pi — pi) — mj 
and for Mi 
g? 
Mrr = Qn)*8? (pi + py — pi — pz) (10.12) 


Adding them together and using rule 5 gives 


1 1 
/ 2 


Mag 7 + 2. 
(p, =p) -mh (p-p) -m$ 


(10.13) 


for the lowest-order matrix element which describes this process. 

Of course we could have integrated over the second delta-function in eqs. 
(10.9) and (10.10). Had we done so, the intermediate steps would have dif- 
fered, but the end result (10.13) would have been the same. 


— ———n o: 


FIGURE 10.7 
B + B — A + A scattering picture. 


Now we can take |.M|^ and put this into the 2-Body cross-section we com- 
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puted last time. In the CM system 


E¡=E2=E, Pp =-Pp=Pp 


and energy conservation implies that E = E”. For simplicity, let's take m4 = 0 
and mg =m. Then 


2 2 
(pj — p1)” — mi = (ph) + (01) -29:m-m 
= 0-- m? — 2(E'E — p- B'cos0) — m? 


= —2E(E — |p| cos 0) (10.14) 
and by similar arithmetic 
(p, — p1)? — m3 = -2E (E + |p| cos 0) (10.15) 
do 


Putting this into the formula for 4 in the CM system gives 


do (y SIMP |p" 
( 


dQ 87 E, + Eje |l 
fie V, CIDE’ | 5 1 1 a 
= 482) 4182 |? | 25E(E-|picosó) 2E(E |p|cos0) 
1 h 2\ 2 3 
=> (= ) £ : (10.16) 
77 Elm (£? — pel? cos? 9) 


where S = 1/2 due to identical A's in the final state and I have inserted the 
correct factors of c in the last line. 


Let's pause to note some things about this formula. At high energies 


do (ho (eg* ^. (ho? (eg) 
dQ E®(1—cos? 0)? ES sint 9/2° 


rapidly with increasing energy (high-energy particles are less likely to interact) 
and with increasing angle (it’s more likely to scatter in a forward (or backward) 
direction than in a perpendicular one). Note also that the units are correct: gc 
has units of energy and fic has units of (energy) x (length), so the cross-section 
does indeed have units of area. 

For a massless A particle, ABB theory is a toy version of QED, with A 
playing the role of a spinless photon, and B playing the role of a spinless 
electron. The scattering process in this example is pair-annihilation, where 
two B particles destroy each other and become a pair of “photons.” We will 
encounter the real-world version of this process a bit later on, and we will see 
that the basic physics described above will be preserved. 


|pc| ~ E, and The cross-section falls off 
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10.4 Higher-Order Diagrams 


All we've looked at so far are “tree” diagrams that are lowest-order in g. For 
example each diagram in figure 10.6 is of order g?. Higher order diagrams 


FIGURE 10.8 
A lowest order diagram for BB annihilation. 


will contribute to this process; for example the diagrams in figure 10.9 all 


FIGURE 10.9 
Some higher order diagrams for BB annihilation. 
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contribute to order g^. — I've drawn the three distinct diagrams in which 
the added line starts on the lower-left B-line and connects to another line 
according to the rules. There are another three for the lower-right B-line, 
but we've already counted one of these (the diagram on the upper-right in 
10.9), so we get two more. Each of the final A-lines can have a B-bubble, 
and there is one more diagram in which an A-particle leaves and returns to 
the internal B-line (an A-bubble). So in all there are eight diagrams, plus 
another eight for the crossed version (where the final state A's have switched 
places — the diagram at the right in figure 10.6). Note that disconnected 
diagrams, those that can be separated into two distinct parts without cutting 
any line, don't count. 

So there are 16 diagrams to compute at the next order. Let's compute the 
diagram in figure 10.10 for which the internal B line has a bubble. There 


FIGURE 10.10 
An A-bubble diagram. 


are four internal lines, and so rule 4 gives four integrations. There are four 
propagators (rule 2) and four delta functions (rule 3). Hence we get 


(i y J d^qi f dtg / d*q3 J d*q4 i i 
Y J OTAJ CJ Quy J Cri- mi g- m, 
3 (27)*64) (pj +a — pı) (amis (qo + qa — qı) 


x 
qj — m$ qi— ME 
x (205% (qa — q — qs) (21)46(9 (ph — qu — pz) (10.17) 


The delta-functions make the integrations fairly easy to do. Integrating over qi 
means setting qı = p1— pj, and integrating over q4 means setting q4 = p5— po. 
'This gives 


(-ig)* ( ae B Y m i m B má J I d “a 
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d*q3 i As(4) / 

S (22)! qd — m3 (20) 0*? (q2 + q3 + py — m) 
3 B 

x (2m)*6 (ph — pa — qa — 93) (10.18) 


Integrating over q2 gives 


(27)*5“) (p! + ph — pı — p2) 


[0 =p)? - m3, 


4 
x | d qs : : (10.19) 


(27)* (p + qs — pi)? — m3, d$ — Mp 


where I have used the delta-function to set p, —p1 = p2 — ph to simplify things. 
Using rule 5 gives (dropping the redundant “3” index on q3): 


iM = y J d'q 1 NN 
(W — pu)? — m3)? J a - mà d? - m$ 
(10.20) 


where the dots refer to contributions from other diagrams. 

We see that we have one integration left to do. At this point we run into 
big trouble: the remaining integral is infinite. This is easy to see: there are 
four integrations to do but only four powers of q in the denominator. For 
large q we get 


d^ ? d x 35. 
La angular integration| X O a ^ In(q — oo (10.21 
2)2 
(a?) o q o q 


and we see that the integral diverges logarithmically. 

This situation is endemic to all quantum field theories: matrix elements 
are power series in small couplings (here g) with infinite coefficients. This 
was first realized in 1930 and remained an unsolved problem until Tomonaga 
(in 1946) [108] and Feynman and Schwinger (in 1947) [103, 109] proposed a 
solution called renormalization. 

How does renormalization work? Let’s parametrize the infinity by “cutting 
off” the integral for large q, i.e., let’s take 


A 
n a f in — -i f dq 
0 


where A is some large momentum and the factor of —i arises from analytic 
continuation!. Here q is a radial type of variable in a 4-dimensional space; 


'This is a technical point — it is easiest to do the integral if we set qo — —iq4 and then 
used methods from complex analysis to calculate the result. This procedure is called Wick 
rotation. 
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there will also be an angular integration, but it will just multiply the integral 
by a constant factor of 277. The rest of the integral gives 


M= (2) 20 D (+) | | (10.22) 


[o -p - m] 


where for simplicity l've taken mg = ma =m. Of course if A — oo this is 
still infinite. 

But now a miracle happens: by adding this to the first diagram of O (g) in 
the B+ B — A+ A process we computed above, we find that this In (A) 
term corrects the mass: 


rcr ui m : prit 


-+0 (9%) (10.23) 


er [me Gm] 


where the dots refer to finite terms. The second diagram in B+ B — A+A 
is similarly corrected by the crossed-version of the A-bubble diagram. So we 
interpret 


mBHYSICAL — mp + ómpg 
2 A 
ER 2 of sg 
= ms | ES (A) 
1 g y? A 


Other diagrams with infinite contributions all either correct the masses of 
the particles or the coupling g. So when we add all such infinities together 
we find 

MPHYSICAL = M + ÔM JPHYSICAL = 9 + ôg (10.25) 


The quantities óm and óg are infinite as A — oo, but that’s okay: we can't 
measure them anyway! All we can measure are mpquvsicAL (for A and B) 
and gpuvsicAL. These are finite by definition. So we can take account of the 
infinities by using mpyysicaL and gpnuvsicaAn in the diagrams instead of the 
original m's and g we started with (which themselves are assumed infinite to 
cancel the infinities in ôm and ôg). 

Of course this trickery doesn't come without a price. The price we pay is 
that the masses and coupling in our theory can never be predicted. They 
must be input from experiment. Perhaps this isn't too bad a price to pay: 
there are only 2 masses and 1 coupling in AB B-theory, so all we need are 3 
inputs. A renormalizable theory is one in which all infinites can be absorbed 
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by these kinds of redefinitions of a finite number of parameters. ABB 
theory is an example of a renormalizable theory. QED and QCD - indeed all 
gauge theories — have also been shown to be renormalizable. Renormalizable 
theories are very important since they have predictive power — they yield 
answers for decay rates and cross-sections that do not depend upon the cutoff. 

Non-renormalizable theories do not have this feature — they have an infinite 
number of infinities, and so yield meaningless results. It would be nice if we 
could just ignore these kinds of theories, but we can't — it turns out that our 
best theory of gravity — general relativity — is a non-renormalizable theory. 
This is one of the major problems of quantum gravity — the methods that 
yield predictive quantum theories for the non-gravitational forces of nature 
(described by gauge theories like QED or QCD) yield meaningless results 
when applied to gravity. 

What do the finite contributions do? They also correct ma, mp, and g 
by finite calculable amounts that are functions of the four-momenta of the 
external particles. This means the effective masses and couplings depend on 
these four momenta, i.e. on the energies of the particles involved. We call 
them “running” masses and “running” couplings. This dependence is slight, 
but it does have observable consequences in both QED (in the form of the 
Lamb shift) and in QCD (in the form of asymptotic freedom), as we shall see 
in subsequent chapters. 


10.5 Appendix: n-Dimensional Integration 


It is common in particle physics to compute integrals over spaces of different 
dimensionality. Most of the time these integrals come from loops that ap- 
pear in Feynman diagrams. These integrals are generalizations of the switch 
from Cartesian coordinates to polar coordinates in two dimensions, or from 
Cartesian coordinates to spherical coordinates in three dimensions. In gen- 
eral computing the integral involves transforming from Cartesian coordinates 
in n dimensions to generalized spherical coordinates in n dimensions. 
Recall that in two dimensions the transformation is 


xl =rcosé 


a? =rsind 


\ > dex = da! dz? = rdrdó (10.26) 


where I have written (x, y) = (z!,z?). The factor of r comes from computing 


the Jacobian of the transformation: 


se —rsin@ r cos 0 


Ox! Ox? : 
e e cos? sinó : 

se | &|- e| | rette rst 
80 ^ 80 


(10.27) 
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The procedure applies to any number of dimensions. We write (x1, x7,..., x”) 
for the Cartesian coordinates and (r, 03, ..., 0") for the angular coordinates. 
'The transformation between them is 


xl =rcos6! 

xz? = r sin 0! cos 6? 

: (10.28) 
z"—-1 = r sin 8! sin 0? ... cos 0"-! 

x" = rsin 0! sin 0? .. -sin 0"-! 


and every angle has a range from 0 to 7, except for 0"-! which has a range 
from 0 to 2z. The Jacobian is 


Or! da? Oz" 
Or Or. Or 
Or! Oc? ee 8r” 
01 01 01 = E =D y. n—3 : = 
det a E . mt =e (sin 9) (sin 9) sing?! 
dat Oz" 
Bam on 59-1 


and can be obtained by induction. So we have 


d?x = dx'dx? --- dx” 
= 01 (sing)? (sin?) ^ ... (sin9"-?) drd6--- der 
= rl drdQr-1 (10.30) 


where dQ”~! is shorthand notation for the angular part of the integral. 
For example, the three dimensional volume element is 


dx = daldz?da? = y? (sin 9) drd0* d0? = r? sin 0d6dó (10.31) 


where I have written the more familiar notation 0! = 0 and 0? = $ in the last 
term. 

How can we do the angular integration? There is a trick to this that makes 
use of Gaussian integrals and the Gamma function. Recall that a Gaussian 
integral gives 


e e" dz = Vn (10.32) 


— oo 


and that the Gamma function (the factorial function) is defined as 
T (n) =f qug gs (10.33) 
0 


Let's consider the integral 


xd —r? m-1 n—1 xd =z n—1 dz n—1 
e r^ "drdQ E e ^ (Vz -fa 
NI 0 ( ) 2/z 
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= 7 a | aon 
2 0 
1 n n-1 
st (2) fa (10.34) 


wher I have set r = 4/z in the first line on the right-hand side; the final line 

easily follows from the definition of T (n). Since r? = CON + (a2)? peces 
2 2 n 

(a^^, we have e”? = e [(2") H(z?) m y] _ Sey sey "pm 

the exponential of the sum is the product of the exponentials. Hence we can 

write 


E e n dd Qn} =| de... f dere- (9) e- (Y... e- e» 
0 as t 
=} ase Y f aste GP ss. f danny 


= (vm) (10.35) 


where the last line follows since each integral gives the same factor of yr. 
We can equate the two expressions to obtain 


II 


2 n 
Jue NUS (10.36) 


for the (n — 1)-dimensional angular integral. For n = 2 this gives f d0 = 
T = 2n, which is the circumference of a circle (the area of a 1-dimensional 


“sphere”) and for n = 3 this gives f d0 = WD = ST, which is the area of a 
2 


sphere of unit radius. 
Our 4-dimensional momentum integrals are typically of the form 


[I f «Fa = poa)? = lE- ra) 
A 


or in other words, a product of functions of relativistic invariants. The easiest 
way to do these integrals is to analytically continue the time components, 
so that qo — iq4, poA — ip44, a procedure called Wick rotation. One then 
has a 4-dimensional integral that can be carried out, and then final answer 
is obtained by Wick rotating back, so that paa — —ipo4. There are some 
mathematical properties the integrand has to obey in order for this procedure 
to be valid, but they will be satisfied for all the integrals we consider in this 
book. In the simplest case we have, after Wick rotating 


=i f ago fan 
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. 22? m 
TE l aag 


Qin? J di (10.37) 


I 


I 


where (3)? = (a) + (q’) j (43) + (4%) * is the “radial” momentum variable. 


10.6 Questions 
1. Consider ABB theory in which the A is massless. 
(a) Find the differential cross-section in the lab-frame for the reaction 
B+A—B+A 


What is the angular dependence of the cross-section at high energies? 
(b) Find the differential cross-section in the CMS for the reaction 


A+A—B+B 


FIGURE 10.11 
Vertex for ABC theory. 


2. Consider a generalization of ABB theory to ABC theory, whose vertex 
rule is given in figure 10.11 and in which the C is massless and the A 


and B have equal mass m. 
(a) Draw all allowed lowest-order diagrams for the reaction 


C+A—C+A 
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and find the matrix element and differential cross-section in the lab- 
frame to this order. 


(b) Draw all allowed lowest-order diagrams for the reaction 
C +C — 2B 


and find the matrix element and differential cross-section in the CMS 
to this order. 


. For the ABC theory of question #2: 


(a) Find the differential cross-section in the lab-frame for the reaction 


C+B—-C+B 


(b) Find the differential cross-section in the CMS for the reaction 


C+C— A+A 


. (a) Consider a process in the ABC theory of question #2 that has na 


external A-lines, ng external B-lines and nc external C-lines. Find 
a simple rule that shows which processes are allowed and which are 
forbidden in ABC theory. 


(b) Test your rule on the following processes. Draw a diagram for each 
allowed process. 
B C+C+C B+B—A+C+C 
A+C—B+B+B A+B+C—-A+B4+C 


. Consider a variant of the ABC theory in question #2, in which both B 


and C are massless and the A has mass m. 


(a) Find the differential cross-section in the lab-frame for the reaction 


A+CG—A+4+C 


(b) Find the differential cross-section in the CMS for the reaction 


C+C—-B+B 


. Consider the process A — B+ B in ABB theory. Draw all diagrams 


relevant for this process to order g?. 


. Consider a theory of three spinless particles A, B and C. Each particle 


is its own antiparticle. The C is massless and m4 > mg. The Feynman 
rules for the theory are in figure 10.12 


(a) What are the possible parities for each particle if parity is conserved? 
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FIGURE 10.12 
Vertices for question 7. 


(b) Under what circumstances can C = — 1 for the C-particle? 


(c) Which particles can be unstable in this theory? Under what condi- 
tions? 

(d) Compute the decay rate in terms of the masses and the coupling con- 
stant(s) for any one of the unstable particles that satisfy the conditions 
you find in part (c) to lowest order in the coupling(s). 


(e) Draw and label all diagrams which correct the process in part (c) to 
next-lowest order in the couplings. 


8. For the theory in question #7: 


(a) Consider scattering of a B particle with an A particle. What par- 
ticles can appear in the final state (to lowest order in the couplings)? 
Draw and label the diagram(s) associated with this process. 


(b) Find the matrix element for this process (i.e., for B -- A — your 
answer) to lowest order in gı and go. Simplify your answer in the lab 
system, where the A particle is at rest. What does it become in the 
limit that the incident energy of the B particle is large? 


(c) Draw two diagrams that are the next-order corrections to the process 
in part (b). Label only the names of the particles in each diagram. 
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Wave Equations for Elementary Particles 


DOI: 10.1201/9781420083002-11 


In order to compute matrix elements for real world theories, we need a better 
understanding of how to write down the incoming and outgoing states for 
actual physical particles. We saw in Chapter 5 that the group of spacetime 
symmetries — space-and-time translations, rotations, parity — have allowed us 
to classify particles according to some basic properties, which I have recapit- 
ulated in table 11.1. 


TABLE 11.1 
Particle Classification 
Space and time Translations Mass-energy (the rest mass, m) 
Rotations Intrinsic spin (spin s — 0, 2. Tias) 
Parity Intrinsic parity (P =+ 1) 
Charge Conjugation Intrinsic charge conjugation (C = + 1) 


However, we have not yet seen how relativity (i.e. boost covariance) aids in 
the classification. To do this, we'll need wave equations for the irreps we've 
developed so far (i.e. for wavefunctions of definite mass and spin) so that we 
can do quantum mechanics, compute matrix elements, and make predictions 
for decay rates and cross sections. 

Clearly we need to go beyond the free-particle Schroedinger equation 


o mv? 


c gc HY = 
‘ Ot 2m 


y (11.1) 


because it is not relativistic: space and time are treated differently. However, 
we can use it to get a hint of how to find relativistic wave equations by noting 
that its solution is a plane wave 


Pp |p 
=“=ÉéL (1.2) 
2m 2m 


V = Yo exp [-i (Et — p- x) /fi] where E- 


where we interpret E as the energy and p as the momentum of the particle. 
The only thing non-relativistic about the solution V is the relationship be- 


12 
tween E and p, namely that E = Br. In general, for stationary states in 
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which HW = EW, we have 


o 

ifi = EV > Y = Ẹ (2) exp [-iEt/h] (11.3) 
in other words the time derivative of the wave function gives us a constant 
that we interpret as the energy. Note that the minus sign in the exponent of 
the plane-wave solution is crucial because it ensures that the energy of the 
particle is positive. Had we written Y = Wo exp [+7 (Et — p - x) /h], we would 

2 

have obtained E = oi i.e., this would correspond to a particle of negative 
kinetic energy, which does not solve the free-particle Schroedinger equation 
(11.1) above. 


11.1 Klein-Gordon Equation 


From now on let's set A = c = 1 for simplicity. The preceding approach 
suggests a guess that a relativistic free particle has the wavefunction 


$ = do exp [- i (Et — p- x)] = ġo exp [-ip,uz"] (11.4) 
where now we require E=\/p-pt+tm? and p"-—(E,p) 
and the sign is chosen in accord with the non-relativistic case. We can find 
the equation that $ obeys by differentiating it twice with respect to x 
2 3 2 2 2 o 
V?$ = idg.Vó—(-P::)ó—(m*—E?)ó-m P+ m 


82 
=> (s v") o nto o (11.5) 


which we can also write as 


(9,0% + m?) 6 =0 (11.6) 


an equation called the Klein-Gordon equation. Here 0, = (& V) = £2. is 
the relativistic gradient, with t = z?. Note that 
o 0 00 00 00 0 0 a? 
0,0" = gi” = (+1) = v? 
Ox" Oz" OtOt  OrOx  OyOy Ozdz OC 
(11.7) 
Since the operator V? is invariant under rotations, so is 0,,O", and so the 


rotation properties of ¢ must be trivial — in other words ¢ is a scalar (i.e. it has 
spin-0). We interpret p" = (E,p) as the 4-momentum of the particle whose 
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wavefunction is 9. Note that for small p, we have E = yp- p+ m? ~ m+ Ig 
and the solution reduces (up to a phase) to the solution for the free-particle 
Schroedinger equation. 

The Klein-Gordon equation (or KG equation) was the first attempt* at 
developing a relativistic version of Schroedinger's equation (and was actually 
first proposed by Schroedinger [110]!). It is relativistically invariant, since 


o Oz" 0 


IM AM SU aem = 
qe Nx duc Or dau 


= A“ „ð, > 0,0'" — 8,0" (118) 


and so 
(9,0% + m?) $ (a^) = (8,0" + m?) olz) =0 (11.9) 
This is nice. However, there is an awkward snag — unlike the Schroedinger 
equation above, the KG equation has another solution 


$ = bo exp [+ip,2"] = do exp [+i (Et — p Z)] = o exp [~i (- Et + P: z)] 
(11.10) 
where the 4-momentum of the particle is now p^ = (—E,—p). The time- 
derivative of à is easily seen to be ¡2 = ES = —vVD:P+ m?o in other 
words the particle has negative energy! Of course this is what we expect 
from a 2nd order differential equation, namely that there are two independent 
solutions. Inserting either solution into the KG equation gives 


(9,0% + m?) é(z) = — (p? — m?) d(x) = 0 
(9,0 + m?) (z) = — (p? — m?) d(x) =0 


This other solution was quite puzzling to theorists in the 1920s. Not only 
were there negative energy solutions, but the probability density associated 
with the equation could be shown to be positive for positive-energy states and 
negative for negative energy states! Even worse, the equation was not able 
to incorporate the newly discovered property of electron spin. It turned out 
that these problems were not occupying too much attention in the physics 
community at that time because shortly afterward Dirac came out with a 
different equation that incorporated relativity into quantum mechanics. 


11.2 Dirac Equation 


The problem with the negative energy solution appears to arise because there 
are two time derivatives in the KG equation (this is what gives p? — m? = 0, 


*In 1926 Pauli wrote in a letter to Schroedinger that he didn't “believe that the relativisitic 
equation of 2nd order with the many fathers corresponds to reality.” The Klein-Gordon 
equation has a rather interesting history [111], involving many people in its early develop- 
ment. 
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which has two solutions for the energy). If we wanted an equation that had 
only one time derivative, then it must have only one space derivative as well. 
So we might guess 

(að +m) V 20 (11.11) 
Note that if a^ is just a 4-vector, then for a plane wave this equation gives 
ia p, + m = 0, which is the wrong relation between 4-momentum and mass. 
'This is not what we want. However, if V were a multi-component wavefunction 
(with components Y), then maybe we could get the right relationship if a^ 
were a matrix. Let's call this matrix ^" — kind of like a 4-vector, but with 
each entry a matrix. 

So let's guess (summing over repeated indices!) 


(i028, — m8?) wy = o 


suppressing 


or (iy"9, —m)p =0 (11.12) 


matrix indices 


Note that we write mI = m, where I is the identity matrix (whose components 
are 07). At this point we don’t know the dimensionality of either the identity 
or of the (4)? (i.e., of the range of the indices a, b), but we will soon find out 
what this must be. 

We need to be sure that we can recover the usual relationship p? — m? = 0 
between the 4-momentum p^ and the mass of the multicomponent wavefunc- 
tion 4. If 4 had no spin, it would behave like a KG field, i.e. like e^ !?*. Let's 
guess that something similar happens here, and set 


ia = tie (per F* (11.13) 


where u4(p) is some N-component “column matrix” (recall that we don't yet 
know the value of N; all we know so far is that a = 1,2,...,N) that isa 
function of the 4-momentum. Our proposed equation becomes 


0 = (i940, — m) y = e '** (yp, — m) u(p) > ("pu — m) u(p) = 0 
(11.14) 
and so we have a constraint on the N-component object u(p). Let's multiply 
eq. (11.14) by the matrix (y“p, + m): 
0 = ("py + m) ("Pu — m) u(p) 
= (pr) (pu) + M (pu) — m (pr) — m?) u(p) 
= (“Y pupy — m?) u(p) (11.15) 
where in the second line I have put brackets around the objects that are 
matrices. The set of manipulations involved in getting the last line is 
v v c b py € b 
(Po) (Pp) = (Pda Q" pu), — (a Pu (We Pu 
v\ € b Cy vy b 
= (a (1%). PuPu = 0"), (1). PuPu 
NN Sige AA C dT Duy (11.16) 
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which follows from a recognition that the y’s are matrices (and so don't 
commute) but the p,,’s aren't (and so do commute). 

This looks like an extra constraint on the column matrix u(p). If we don't 
want this to be an extra constraint then we must require it to hold because 
p? — m? = 0 and not for any other reason. Hence we must constrain the 


“y-matrices to obey 
b 
("a Ve pup, = mà, = (p — m*) fa = 0 
or alternatively 4"^"p,p, = p? = g""pyp, (11.17) 


As noted previously the ^4/'s are matrices and the p,,’s are not, so we can 
write 


1 
YY Pudo = ¿Y (PuPv + PuPu) 


1 1 
= 3 Y PuPu + 31 Y PuPu 


1 v V 
5 OPY t Y") pup» (11.18) 


where the last line follows from relabeling the indices. In order to ensure that 
"y" Duby = p? we must have 


1 v v V 
5 (MV HWY") Pupo = 9" pup» (11.19) 


or alternatively, since this must hold for any p, : 
(nl p a y eager (11.20) 


where I is the identity matrix. 
The preceding relation provides a very important constraint on the y” ma- 
trices. We can write it as 


{yy} = 29"" (11.21) 


where {y#,7”} = (q4" + y 4") is the anticommutator. For any two opera- 
tors or matrices, the anticommutator is defined as 


{A,B} =AB+BA (11.22) 
So any set of 4 y-matrices that obey eq. (11.21) will guarantee that p? — m? = 
0 for the y wavefunction without imposing any further constraints on y. The 
simplest solution! to (11.21) is when N = 4, i.e., the y-matrices are 4 x 4: 


tHow do we know that this is the simplest solution? It’s done by brute force. The matrices 
must be at least 2-dimensional, and must obey (49)? L-I--— (my =- (my =- (43)? 
as well as anticommute with each other. The three 2x2 Pauli matrices do this, but we 
need a 4th matrix (the 5 matrix) that also does this, so 2x2 (N = 2) is too small. You can 
show that there are no 3x3 solutions. The next simplest thing is the 4x4 case shown here. 
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0 I0 i 0 c? 


: 1) is the 2 x 2 identity matrix and the o^ are the 3 Pauli 
matrices. Note that we can obtain many other 4 x 4 solutions to (11.21) by 
writing 5" = U'y#U where U is a 4 x 4 unitary matrix. It doesn't matter 
which 4-matrices you use as long as they satisfy (11.21). 

So we see that since the y-matrices are 4 x 4, the wavefunction ~ must be 
a 4-component wavefunction v, and that it obeys the equation 


where I = ( 


(i58, — m)v =0 (11.24) 


which we now call the Dirac equation, proposed by Dirac [112] in 1927 as 
a relativistic generalization of the Schroedinger equation. Any wavefunction 
that obeys this equation automatically satisfies the KG equation (but not the 
converse!) since all its solutions will have p? — m? — 0. In this sense the Dirac 
equation is like the “square-root” of the Klein-Gordon equation. 

Note that, although w has four components it is not a 4-vector, in other 
words it does not transform under a Lorentz transformation the way the a 
4-vector does. That's why I used Latin indices a,b to label the components 


of v. 


11.3 Physical Interpretation 


So what is v? Let's look at the solutions to the Dirac equation to see if we 
can find out. The block-structure of the y-matrices suggests that it might be 
useful to write 


Y = u(p)e '"* = B pp (11.25) 


where € and x are each 2-component objects. Recalling that p, = (E,—p), 
and using “0, = Y i$ +7. V, the Dirac equation (11.24) then can be 
decomposed as follows: 


ides) une) —m gem) =0 
= eio? [i (iE + i p) (u(p)) — m (u(p))] = 0 


ox E—m-p-:o £ 
ip-x = 
>e Fr EUN b) =0 (11.26) 
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This equation breaks up into two parts 


(E-m)&- (9-3)x=0 
-(E4m)xG-3)6-0 


p.c VN um 
>y= = -0 11.27 
box uto ues 6 0t aua) 
where I used the lower equation to determine y in terms of €. Inserting this 
solution into the first equation just gives 


(a) , Ema 


li ec rs ee a EORR 


£-0 

(11.28) 
since (f. 8)? = p- p, and E? = m? + |p|’. Consequently the components of 
€ are not determined, and we have 


E y g En 
i — 8t(p d oe e IP? — N(p zm eg ive 
(p) aes Ê- 6)6 (p) V Bre (D-8)6 
(11.29) 
where 9t(p) is an undetermined normalization. Let's choose it to eliminate 


the energy term in the denominator of 4, so that 9t(p) = vov E + m, giving 


E+m 
Y= Vov2m | Y 2 -— ere (11.30) 
2m pio 


where wy is just an arbitrary constant. 
Now notice something. Suppose we are in the rest frame of y, where p = 0. 
In this case E — m and we have 


Y (B= 0) = vov2m a ev (11.31) 


and so there are really only two independent components of w, namely the 
two components of €! We can normalize these so that 


EM = (5) audet (1) (11.32) 


which means that we actually have two independent solutions to the Dirac 
equation 


B+ e) 
y (2) = u (a) = yo v2m | Y 2" _ jee (11.33) 
Ee (p a) E0 


corresponding to each of these possibilities, with i =f, |. 
Dirac interpreted these solutions to correspond to the spin-up electron and 
the spin-down electron — something I’ve anticipated in the notation above. 
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The solution with £(? corresponds to a plane-wave solution of a particle whose 
spin is parallel with respect to the f-axis, and the solution with £() corre- 
sponds to a plane-wave solution of a particle whose spin is anti-parallel with 
respect to the p-axis. Non-relativistically we can ignore the lower two compo- 
nents of w, as the above derivation shows. Indeed, since non-relativistically 
there are only two independent solutions, the wavefunction 4 must correspond 
to a spin-1/2 particle, because this is the only representation of the rotation 
group that has two components! 


11.4 Antiparticles 


Hence the solutions to the Dirac equation are those of a relativistic particle 
of spin-1/2: a relativistic spinor! Superfically it appears that the lower two 
components have no physical content, but Dirac soon realized this was not 
correct. Since the Dirac equation is a 4-component coupled first-order differ- 
ential equation, it must have 4 independent solutions. The natural thing to 
do to find these other solutions is to write 


Y = v(p)e*!P* = E pus (11.34) 


and see what happens. Using the same calculational approach as before, we 
find 


(Et m)& —(p-o)x— EE CO Ø anb 
7 (E-mx- (p cops (Et m) Eam IX 


where we now write € in terms of x so as to have a well-defined non-relativistic 
limit. Hence 


E-m (5.2 ¢(i) 
A : z 4/ p:c)€ . 
Qj = v0 (x) = dg v/2m zm. (P 2) eina (11.36) 
jE km eli) 
2m 

are two other solutions to the Dirac equation. Now the non-relativistic limit 

(E — m) removes the top two components instead of the bottom two! 
Note that the e?” factor is just like the negative-energy solution to the 
KG equation — and so we expect that the v(p) solution is a negative-energy 


spin-1/2 plane wave. It looks like we have not cured the problem that we 
encountered in the spin-0 case after all. 
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Dirac thought that perhaps the v(p) solutions could be eliminated from 
the theory?. Since the solutions to the Dirac equation have spin-1/2, they 
should obey the Pauli principle: there can only be one particle of a given 
spin per state. So Dirac hypothesized that all states corresponding to the 
v(p) solutions were filled: there was a completely filled sea of negative energy 
electron states! 

Weird as this might sound, it offered a reason as to why we don't observe 
the negative energy solutions — they are all occupied, and so do not physically 
appear in any experiment ever carried out?. However, Dirac also realized that 
if one of these states ever received an energy larger than 2mc?, the electron in 
that state would “appear” as a positive energy particle, since it was excited out 
of the occupied sea. It would also be accompanied by a hole in the sea that had 
the same energy but opposite spin, momentum, and charge as the electron. 
The hole was interpreted as a positively charged electron: the positron [113]. 
Hence the relativistic generalization of a spinor led to a remarkable prediction: 


Filled 


FIGURE 11.1 

Schematic Diagram of the Dirac sea. Energy (represented by the lightning 
bolt) can excite a state in the filled sea up into the empty region. We observe 
this state as an electron; the hole left in the sea is observed as a positron. 


the existence of antimatter! Positrons — the antiparticles of electrons — were 
first observed in 1933 by Anderson [84], confirming Dirac's prediction. 


tDirac originally wanted to interpret these other two solutions as corresponding to the two 
spin states of the proton. However, because of the e*P"? factor, these solutions appear to 
have negative energy (as did the e*P'? solution to the Klein-Gordon equation) and so this 
can't be right. Furthermore, this would force the mass of the proton to equal that of the 
electron, which is also experimentally unacceptable. 

$Note that we can't use the same reasoning for the KG equation because its solutions don't 
obey the Pauli principle. 


210 An Introduction to Particle Physics and the Standard Model 


Nowadays we don't think in terms of a filled sea — we regard the e~’?’” solu- 
tions as corresponding to particles and the e+’? solutions as corresponding to 
antiparticles. This approach means that we can also make this interpretation 
for the KG solutions as well. Indeed the KG equation is valid for all spin-0 
particles and their antiparticles; the Dirac equation is valid for all spin-1/2 
particles and their antiparticles*. 

Antiparticles also have another interpretation. Notice under time reversal 
that 


T (exp [+ip,2"]) = T (exp [i (Et + p- £))) 
= exp [$ (Et - g- ) 
= exp [- ip, z"] (11.37) 


which is like a positive energy solution! This was Feynman’s interpreta- 
tion of the negative energy states: an antiparticle moving forward in time is 
equivalent to a negative energy solution moving backward in time! 

'The Lorentz-transformation properties of the relativistic scalar are quite 
simple, and we looked at them above. However, the Lorentz-transformation 
properties of the relativistic spinor are quite complicated. The details don't 
matter much for the remainder of the text, and so I’ve included these prop- 
erties in an Appendix. 


11.5 Appendix: The Lorentz Group and Its Represen- 
tations 

In order to see how spinors — or any other relativistic wavefunctions — trans- 

form, we'll need to look at the underlying symmetry group. This group is 

the Lorentz group, the group of all matrices A" ,, which obey 


gu, A" oA" a = gag (11.38) 


Since this is similar to rotations (RTR = I), let's proceed the way we did in 
Chapter 3. In that case we wrote R (0) = exp EJ : J| , and then used closure 


under small rotations to get the commutation relations for the rotation group, 
whose generators are the 3 J’s. In a similar manner, Lorentz transformations 
are either of the form 


A. (11.39) 


This interpretation of particle and antiparticle wavefunctions properly requires the con- 
ceptual and calculational foundations of quantum field theory [97]. 
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10 0 0 1.0 0 0 1 0 0 0 
B 01 0 0 0 cos 0,, 0 — sin 0, 0 cos0, sin0,0 
rh 00 cos@é, sind, |’|0 0 1 0 ’ | 0 — sin 8, cos0, 0 
0 0 — sin 0, cos 6, 0 sinó, 0 cos0, 0 0 0 1 


or, for boosts 


M got (11.40) 
Ye —BrYz 00 Vy 0 — By Yy 0 Ye 00-6272 

— —Boye. “Ye 00 Or SE 0 0 0 10 0 

F 0 0 10|'|[-5,40 wy Of? 0 01 0 
0 0 01 0 0 0 1 —BeYz 00 yz 


where Yyy = 1/4/1 — 62 etc. 

We write 0 = 0 to denote a rotation of angle 0 about an axis pointing in 
the direction of the unit vector ù, and 8 = Bn to denote a boost to velocity 
Bc along the f direction. For small |6| and || we obtain the generators 


tU (11.41) 
0000 000 0 0 0 00 
4 [0000| ;[000-1| |0010 
= oor Teo? |o000 175 |o0-100 
00-10 010 0 0 0 00 
PER (11.42) 
0 -100 0 0-10 0 00-1 
_)_,{-1 000] ;[0000| ;[0000 
= o 000|v*|—1000]' 0 00 0 
0 000 0000 -100 0 


which can easily be shown to give the commutation relations 
Ea | =iF* + cyclic 
[es | = ij^ + cyclic (11.43) 
Ee Ko] —iK^ + cyclic 


The first part is the familiar set of commutation relations from the rotation 
group SO(3). The rest of the commutation relations indicate how the boost 


generators K! interact with the rest of the group. 
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(z;.vV) ey ^ [aut] dxo = (x) "vy 109994 
(zy) h a Lanse | dxo = (2) 1ourdg 
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uorjeuriopsued], [M4 So[ourutoT, 
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The algebra of the Lorentz group is a bit awkward written in this way. We 
can clean things up by writing 


J” = -3P = iki gu = gk (11.44) 
so that 
[e y`] — (gò _ gg zie ge = gay) (11.45) 


which embodies all the relations between the T 's and K’s above. 
We now regard any solution to (11.45) as a valid way to represent a Lorentz 
transformation. The most general solution is 


(qu spes ey (11.46) 


where £*” is the orbital part and G*” the spin part of the Lorentz generators. 
Analogous to the rotation group we have 


LH” = i (z^ 0" — z"0") (11.47) 


and it is not too hard to show that £"" is a solution to (11.45). In the 
rest-frame of a particle, this will vanish when acting on any wavefunction; all 
that will remain is the spin-operator 6*” acting on the wavefunction. 

Hence we classify particles by their spin-values — i.e. by the possible G"'s 
that are irreducible solutions to (11.45). Finding all of these is an exercise 
in group theory beyond the scope of this text. The answer turns out to be 
analogous to what happened with rotations, as shown in table 11.2. 

The matrix X^" = 2 ([y",7"]), and the associated tensor wj, is a parameter 
matrix; it’s analogous to the g parameters in rotations. In fact, by comparing 
the Lorentz rotations to the standard rotations, we find 


Bj 


The quantities 0; are called the rotation angles and the quantities 7; are called 
the rapidity parameters. Note also that the Lorentz and Dirac indices are quite 
distinct from each other, even though both take on 4 different values. This is 
a coincidence that holds only in 4 spacetime dimensions!. And don't confuse 


Wij = CijkÜk and sinh Woj = sinh Nj = Bidi = (11.48) 


the relativistic yj = (1 — ay * with the Dirac gamma matrices — they are 
mathematically distinct quantities that for reasons of historical accident make 
use of the same symbol. 


'The range of a Lorentz index is from 0 to d— 1 in d spacetime dimensions, whereas the 
range of a Dirac index is from 0 to 214/2] where [d/2] is the integer just less than d/2. Hence 
a spinor in 11 dimensions has 32 components, but a vector has only 11. 
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Let's look a little more closely at the spinor transformation matrices. We 
have 


EE i i (0 o 
s = i (er =i (0 (11.49) 


y. ‘ (by) = 2: e " zs m a Mx) (11.50) 


So for rotations 


i —— „ ifo 
Wuv = Lu) = dag, = - > 
ae 2-1 ge a 0 js) 
E exp (39 5) o 
=> exp ES = s 5 (11.51) 
2 0 — exp (50-2) 


and for boosts 


gu EP" = iwoZ” = im ne = 


7 
0 
: 1 0 7:56 
=> exp (ig X9) — exp ls i^) 
0 
e 


Seah (2) 4$ ( 3) gh e (11.52) 


where the last line follows by taking the power series of the exponential func- 
tion and noting that 


NI = 
TAS 
=3¡0 
Al 

aA 
NY 


= [Rf Laxa (11.53) 


where I4,4 is the 4 x 4 identity matrix. 
Consider a spinor of the form 
5 ) 
vet 


Under rotations it transforms as 


(11.54) 
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and so we see that rotations do not mix up the upper and lower components. 
'This is consistent with our non-relativistic experience that an electron wave- 
function (which would have only the upper two components) can always be 
rotated into a form that is fully spin-up or fully spin-down. 

E= 272, then we can easily show that 4” = —w, which is what a spinor 
must do under a 27 rotation as we saw in Chapter 5 (see eq. (5.18)). Under 
a boost it transforms as 


1/0 Fee 
!— = 
ol (a0) 
cosh ( Hl) ¢4+9-é@sinh (5) o 
= is m (11.55) 
cosh ( 2- J o f): &sinh( Z JC 


and so we see that boosts do mix up the upper and lower components. Hence 
a moving spinor is a linear combination of electron-like and positron-like wave- 
functions. 

One other issue — what exactly is ^"? Is it a 4-vector or a matrix? Actually 
it's a bit of both. Using the relationships above it is possible to show that 


STIPS = A^" (11.56) 


where exp [Sw X^] = S is the Lorentz-transformation matrix for a spinor 
under a Lorentz transformation z'" = A" g”. Using this it is not too hard to 
show the following relationships 


cu = 
py = oy 
==) = 
byl = det (A) (v? v) 
V yy! = Ah, (dp) 
—/ I 
Y "yy! = det (A) At, (uy ^v) 
where y? = i9 1474? is the product of all the y-matrices, and y = yy". We 
see that v transforms as a scalar, but yï% transforms as a pseudoscalar, 


since det (A) — —1 under a reflection. Similarly, py” has the properties of 
a 4-vector and wy”? the properties of a 4-pseudovector. 


11.57 
11.58 
11.59 


( 
( 
( 
(11.60 


) 
) 
) 
) 


11.6 Questions 
1. Show for Dirac spinors u(p) and v(p) that 


Noa = (p m), and Y v (vi) = (p — m),, 


=P, i-t 
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2. Find spinors u that satisfy (7p, — m) u(p) = 0 that have positive en- 


ergy and that are eigenstates of the operator p - S where is the unit 
vector of the 3-momentum of the spinor and 


>- Ha Af 
8- 32-5 (02) 


is the spin angular momentum operator. 


. (a) Write the Dirac equation in Hamiltonian form by isolating the time- 


derivative of the spinor y on the left-hand side of the equation. The 
Hamiltonian H will be the operator on the right-hand side of the equa- 
tion. What is H? 

(b) Find the commutator of H with the orbital angular momentum 
operator i 


(c) Find the commutator of H with the spin angular momentum oper- 
ator S. 

(d) Find the commutator of H with the total angular momentum oper- 
ator J= L+ S. 

(e) Show that all spinors are eigenstates of $ - J = S?. Since S? = 
h2s (s +1), what is the value of s for a Dirac spinor? 


4. Prove the following 


Tr[/] 24 and Tr y" 4^] = 4g"" 

Tr [odd de of y-matrices] = 
Tr [yyy 9yP] = is HE sce e LS EG gea) 

Tr [y y Boy yo e] = epe with <” = —1 


Ya Y Y^ = —2y" 
Nay yy = 4g” 
Tr ly" | =0 


where e#” is the 4-dimensional Levi-Civita symbol (or epsilon-tensor) 
which obeys 


gs = 1 £0123 = +1 
uvaB _ J —1 (if uva B is an even permutation of 0123) 
3 — | +1 (if uva is an odd permutation of 0123) 
_ f +1 (if uva is an even permutation of 0123) 
SPP al (if uva is an odd permutation of 0123) 


5. Writing exp [4 W yy 3" ]= S, show 


STIS = Ay" 
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where 2’# = Ax” is a Lorentz-transformation. (Hint: use exp(z) = 
limy (14215. 
6. Verify the following Lorentz transformation properties: 
DY = V 
Uy = det (A) byw 
Dipl = Muy 
J^! = det (A) AM Py’ yah 
for a Dirac spinor v. 
7. Charge-conjugation transforms a spinor w according to the relation 
CY = iy 


where the * denotes the complex-conjugate. Compute the charge-conjugates 
of v(x) and v(x). How do they compare to uU (x) and ut) (a)? 


12 
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Perhaps the most powerful symmetry principle in physics associated with the 
non-gravitational interactions is gauge invariance. It forms the foundation 
of our understanding of the Standard Model and all of its generalizations. 
Its origins emerged in the 1820s with the discovery of electromagnetism and 
the first theory of electrodynamics. Over a period of several decades of ex- 
perimental study and theoretical refinement, physicists realized that different 
forms of the vector potential result in the same observable forces. James Clerk 
Maxwell [114] formulated the equations of electromagnetism that embody the 
first known gauge principle, though the nomenclature “gauge” was not used 
then and the equations were written in a rather obscure way that many peo- 
ple found hard to understand. The quest to understand relativistic quantum 
mechanics in 1926 led Klein [115] to formulate the Klein-Gordon equation in 
such a way that Fock [116] discovered it was invariant with respect to mul- 
tiplication of the wave function by a phase factor that depended on position 
and location, provided one incorporated the vector potential in a suitable way 
[117]. 

Hermann Weyl [118] declared this invariance as a general principle and 
called it Eichinvarianz in German and gauge invariance in English. In this 
chapter I will introduce you to this principle in its simplest context. 


12.1 Solutions to the Dirac Equation 
We have seen that the Dirac equation 
(iy*0, — m) v =0 (12.1) 
has the following complete set of solutions 
leechers) ist 
(p — m) u(p) = 0 (f + m) v(p) 2 0 (12.2) 
l E+m ¢(i) l E-m (6. 3) ¢(i) 
up) = vim | VE (p) = vim | V 2-0 0 


V Ame (9-0) £0 p 
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where for convenience I have employed the “slash” notation, in which any 
4-vector that is multiplied by a y-matrix is written with a slash through it: 
for example, p = y"p,, and = ðn. 

If we wanted to obtain quantum probabilities from these solutions, we might 
expect that the relevant quantity to compute is 414. Let's try this for a 
specific u-type solution that is, say, spin-up. We find 


Poy 
= ul) (pu (p) 


= (vov) ( Bam e(t)t , /B=m¢(1)t (9-4) ) 2m 


= om? [ez ¿DIED y (55 EDI (p. 2) @-a) en] 


amos (22) + (52) 


= 2Eve (12.3) 


which is not Lorentz invariant! The final answer depends on the energy, whose 
value depends upon the (boosted) frame of reference, and so the probability 
will depend on the frame as well. 

Clearly wy is not the correct thing to compute. But what is? How can we 
fix this problem? 

Notice that if the two factors in the second-last line were subtracted, then 
we would have a Lorentz-invariant quantity. This means that we need the 
lower two components to subtract instead of add. We can arrange for this to 
happen by defining the conjugate of y) to be = y Ty". In this case we find 


By” 
= UY (pju (p) 
= V2m | Ecm e(t , | E-me(Dt (9. gt ( ) M. cars 
(vo m) 2m 5 es (pian) 0-I (52 6-8) EM) 


; EE EGLI 
- (svi) (ee Haren o) (VE 


= = (p- a) EM) 


iod [ez ¿DEN — (55 et (5. 2) Ga) e| 


= 2mve (12.4) 


which is Lorentz-invariant! 
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The idea that the conjugate of y is not just its complex-conjugate transpose 
(i.e. the dagger) may seem strange, but it actually is what we need to make 
sense of the conjugate of the Dirac equation. Taking its dagger, we find 


0 = ((i5"8, — m) y)! = 40,01 (3)! — my! (12.5) 
which doesn't look much like the conjugate of the Dirac equation. However, 
using the relation* (y)? = 494/49 (and the fact that (7°)? = 1) we get 

0 = (6110, — m) y)! = 49,9 (y^)! — molt = — (ibp + mip) Y 
= v (ið +m) =0 (12.6) 
where the symbol a , means that the derivative operator acts on objects to 
its left. 


So it is v that obeys the conjugate Dirac equation, and we say that w is 
the adjoint of v. 


12.2 Conserved Current 


How then do we interpret yty? Well, we can write yty = yw, which 
looks like the 0-th component of a 4-vector. Indeed, from eq. (11.59) in 
Chapter 11, we know that 4y*4 has the transformation properties of a 4- 
vector. Furthermore, the Dirac equation implies that 


9, (Uy v) = (907) (0) + bd = imi — imi = 0 (12.7) 


which means that Yy” is conserved! Suppose we set yty x p and vip ox 
. We then find that 


0 = 8, (9^) = do (sty) +9 - (976) e LL V7. a28) 


which is just like the conservation of an electric current J” = (p, Z) 


Op 


apt VS 20e 8,7  —0 (12.9) 


In other words, the Dirac equation automatically has a conserved current 


prep! 
Indeed, if we assume that the wavefunction y is that of an electron, it 
should have a charge (—e), and it is tempting to interpret p = —epty as its 


* This is easily proved by brute force. Just check it for each value of u. 
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charge density and ut =-e (pty) = —ew as its electric current. Dirac 
thought that this was correct, and suggested using it in Maxwell’s equations 


V-B=p=-epty (12.10) 


OE gy BEG cia (12.11) 


as the current and charge density. 


12.3 The Gauge Principle 


The presence of a conserved current indicates the presence of a symmetry. 
Let’s see what that symmetry is. 

If we are going to use the 4-current —epy”y as a source for Maxwell's equa- 
tions, then we should also modify the Dirac equation; otherwise the Dirac 
wavefunction will influence the evolution of the electromagnetic field (it pro- 
vides a source for the field), but the electromagnetic field won't affect the 
evolution of the electron (which it must, from observation). 

How can we do this? Consider the phase of y(x). The charge and current 
densities remain unchanged if we redefine y(x) — e'^wv(x), where a is some 
constant. This is as we expect — the diffraction pattern of an electron beam is 
insensitive to phase changes. However, we actually have more: we would find 
that p and J would remain the same even if a = a(x) — even if we changed 
the phase differently at every point in space and time! Can we impose this 
much more powerful symmetry? 

At first, it looks like it might be hard to do this. Under the transformation 
W(x) > (x) = e (x) we find that the Dirac equation changes 


(^8, — m) Y! = e'*€ (iy^, — m) 9 — y” (8,0) V] (12.12) 


because we pick up derivatives of a. Since we want this more powerful 
invariance to hold (i.e., we want a = a(x)), we must invent a new kind of 
derivative operator D,, such that (Duy = eie) p, y. For this to work, we 
will need to introduce another 4-vector — let's call it A, (x) — that ensures this 
transformation property. This 4-vector — referred to as a vector field, since it 
depends on space and time — has to compensate for the 0,,a term. So let's 
define 

D, V = 0, V + ieA, V (12.13) 


which can be valid for any wavefunction Y (and not just a Dirac wavefunction). 
This new derivative operator will cancel out the a-derivative provided we 
require the transformation to be 


W'(z) =e (xz) and A), = A, — aya (12.14) 
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so that 


i 1 ; 
(D, V) = gio) On +1 (ô a) W + ie (4, — D | = ¿Un DU 


(12.15) 
This transformation is called a gauge transformation. If o is constant then 
we call it a global gauge transformation, whereas if o is a function we call it a 
local gauge transformation’. Note that the group of transformations depends 
upon one continuous parameter a. From our discussion on Lie Groups in 
Chapter 3 (recall table 3.3) we know that this group of transformations is 
U(1): the set of transformations of unitary 1 x 1 matrices (i.e. of complex 
phases). The derivative operator D,, is called a gauge covariant derivative 
because it “co-varies” along with the gauge transformation (i.e. it transforms 
the same way that Y does). The object A, must be a 4-vector by Lorentz 
covariance, and is in general a function of space and time. 
So the locally gauge invariant Dirac equation is 


(iy D,- m) =0 (12.16) 


which couples the wavefunction y to Aj. 


12.4 The Maxwell-Dirac Equations 


Now we need an equation for A,, to obey — one that is gauge invariant, Lorentz- 
covariant and (we hope) simple. We could try 


8" A, =0 (12.17) 


which is simple, but not gauge invariant. The next-simplest thing to try is 
something with two derivatives*. So let's try 


a0,0" A, + b0,0" A, = 0 (12.18) 


where a and b are constants, and demand that it be gauge-invariant: 


0 = a0,0" Al, -- 50,0" Al, = að ð" A, +b0,0" A, — T (ast) (8,0^8,o) (12.19) 


+The terms local and global can be understood like this. Consider a cubic box filled with 
billiard balls all of the same color that are closely packed. Rotation of this box about any 
axis through its center by a 90? angle leaves the box looking the same. This is a global 
transformation because we have done the same thing to every ball in the box. However, 
if every ball is individually rotated by any amount about any axis (while remaining inside 
the box of course) then this operation also leaves the box looking the same. This is a local 
transformation — we have acted differently on every ball in the box. 


tNote that we can’t use the y-matrices here because A, has no spinor indices. 


224 An Introduction to Particle Physics and the Standard Model 

which forces a 4- b — 0. Scaling out the constant a gives 
0,0" A, — 09,0" A, — 0 (12.20) 

which may be rewritten as 
O"F,,-—0 where F,, = 0,A, — 0,A, (12.21) 
The quantity F,,, is obviously gauge invariant: 
Fl, = ð, Al, — 0,A', = 0, (4. = Da) um (^ = x) 

= 0,AÀ, — 0, A, = Fuv (12.22) 


It also obeys 
Ox Fus + Ou Fix + OF yy = 0 (12.23) 


Now notice if we write 


ðo A _ OA; 04; 
= Ej and Fiz = Əxi x Dai 


Da? Ot 


Fio = Eijk Dk (12.24) 


then the previous two equations become 


OF, =0 «| "oM (12.25) 
-2E 4+V x B= 

OxFuv + Ou Fix + OP yy =0= Pus: (12.26) 
Bt +VxE=0 


which are eqs. (12.10,12.11) with p = J = 0: the source-free Maxwell- 
equations! 

We therefore interpret A, = (t, A) to be the electromagnetic 4-vector po- 
tential, with A the usual vector potential and y = Ao the Coulomb potential. 
Since A,, has a single 4-vector index it transforms like a spin-1 wavefunction 
under Lorentz-transformations: A, =- AL A,. The quantity Fu» is the field 
strength of this potential (sometimes referred to as the Faraday tensor) since 
it encodes both the electric and magnetic fields that are determined from Aj. 

Based on Dirac’s idea above, we now see how to couple A, to v. We write 


OF, =I (12.27) 


and this will modify the first two Maxwell equations as already noted. The 
current J, can actually be anything provided it is conserved since 


OI, = 0"0^F,, =0 (12.28) 
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where the latter equality holds because F,, = —P,, and 00% = ð — 


— Oz, Ot, 
Do. a = gea However, we want to couple A, to Y and vice versa, so we 
set J” = —ewy"w and write the following closed system of equations: 


o! F, p= —eb 7, Y 
(^D, m)o 0 (12.29) 


which are called the Mazwell-Dirac Equations. 

The Maxwell-Dirac equations, when quantized, are the theory of Quantum 
Electrodynamics, or QED. Formulated by Tomonaga [108], Feynman [103], 
and Schwinger [109], it was the first quantum field theory constructed, and 
the one with the most spectacular agreement between theory and experiment 
of any non-gravitational theory known — up to parts in a trillion! 


12.4.1 Physical Features of the Maxwell-Dirac Equations 


'The Maxwell-Dirac equations have a number of important features that I shall 
summarize here. 


1. All non-gravitational interactions are founded on the principles we used 
to construct QED, namely Lorentz-covariance and local gauge invari- 
ance. For QED the local gauge invariance is just a local phase invari- 
ance. For the weak and strong interactions there is also a local gauge 
invariance that has a conceptually similar character but becomes more 
complicated in detail, as we'll see in subsequent chapters. 


2. The gauge principle — plus the simplicity of minimal coupling (the fewest 
number of derviatives and terms possible in each equation) gave us a 
set of four wave equations for the spin-l wavefunction A,. We know 
these equations as Maxwell’s equations. As a consequence of the gauge 
principle the A, wavefunction (or gauge field as it is more commonly 
called) is massless. A mass term m would modify eq. (12.27) to be 


OF, +A, =I 


but this would not be invariant under the gauge transformation (12.14), 
and so we must set the mass to m = 0. This is a general consequence 
of gauge invariance — the gauge fields are massless. In QED we call the 
particle whose wavefunction is A, the photon. 


3. The group of gauge transformations depends on one parameter — the 
phase function a — and so the symmetry group of the theory is U(1). 
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4. The local gauge invariance implies that the charge Q = —e f dx yy 
is conserved?. 


5. Gauge-invariant theories are renormalizable: they have predictive power. 


12.5 The Wavefunction of the Photon 


We can solve the source-free Maxwell equations for A,, quite easily. Suppose 
we have a solution A, for the source-free Maxwell equations. Then A, + 0,A 
also solves this system, and we can use the gauge freedom in the function A 
to impose a constraint on Aj: 


8" A, =0 (12.30) 


a constraint known as the Lorentz condition. In this case Maxwell’s equations 
become 
0,0" A, — 0 (12.31) 


which is like a set of four massless Klein-Gordon equations — one for each 
component of A,,. Hence the solution is 


A,(x) = age,(p)e ?** (12.32) 
where ao is a constant and the Maxwell equations and the Lorentz condition 


imply 
p?=0 and e-p=0 (12.33) 


$ This can be shown as follows. 
d M" 3 9 (— 0 
S9 7 cef $e Z (51%) 
+e bad : (174) 


" j 48. (vsu) 


=0 


where the 3rd line follows from Gauss’ theorem and the last line follows by taking the 
surface intergral to be at large distance where y vanishes. 

“Two years after Maxwell the Danish physicist Ludvig Lorenz [119] also formulated the 
basic equations for electromagnetism, and reached the same conclusions Maxwell did about 
the relationships between light, charge, and current. He also formulated eq. (12.30), so we 
really should call this the Lorenz condition. However, more than 40 years later the Dutch 
physicist H. A. Lorentz wrote extensive encyclopedia articles and a book on electromag- 
netism [120] establishing him as an authority in classical electrodynamics and leading the 
community to refer to eq. (12.30) as the Lorentz condition. So don't feel too guilty if you 
accidentally drop the ‘t’ every now and then! 
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The quantity ¢,,(p) characterizes the spin of the photon. It looks like it has 
three independent components, since e - p = 0 is only one condition. However, 
this is not right: the Lorentz condition (12.30) does not allow us to uniquely 
specify A, because we could perform a further gauge transformation with 
functions Ç that obey 0"O,G = 0. To eliminate this last bit of indeterminacy 
we can impose 


Ay=0 V-A=0 (12.34) 
a set of constraints known as the Coulomb gauge, in which case 
e” =(0,€) where ê-ê=1 and é-p=0 (12.35) 


and so €,(p) actually has only two independent components. These corre- 
spond to the two possible polarization states — or spin states — of the photon. 
For a photon moving in the 2-direction, we can write 


cls) EN (0, 1, 0, 0) chy) € (0, 0, 1, 0) (12.36) 


and any photon wavefunction will consist of some complex linear combinations 
of these two polarizations. For example a circularly polarized photon would 
have 


1 1 
g^ C) = —_(0,1, 1,0 et) = —(0,1,-i,0 12.37 
Jg 0 1,4,0) Jg 01. -4,0) (12.37) 
and these correspond to the two spin states s; = +1 of the photon. 


So instead of four independent components for the photon wavefunction 
A, we have only two. But a massive spin-s particle has 2s + 1 spin states 
as we saw in Chapter 5. Since the photon has spin s — 1, shouldn't it have 
2s + 1 = 3 independent spin states instead of 2? The reason it has only two 
independent spin states is because the photon is massless. This means that 
there is no rest frame for the photon (or any massless particle), which in turn 
means that is not possible for an observer traveling in the same direction as 
the particle to move faster than the particle and observe a reversal of the 
component of its spin along its direction of motion. Hence the component 
of spin along the direction of motion for any massless particle must be fixed, 
which means that it must be either aligned or antialigned with its direction of 
motion. Hence a massless spin-s particle has only two spin states regardless 
of the value of s (unless s — 0 in which case it has only one spin state). 

We are now in a position to be able to write down the Feynman rules for 
QED, the subject of the next chapter. 


12.6 Questions 


1. Apply the local gauge invariance principle to the Schroedinger equation. 
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Find the Maxwell-Klein-Gordon equations, applying the same procedure 
that was used to find the Maxwell-Dirac equations. Note that you will 
have to consider a complex scalar field y in order to make this work. 


Show that the current 


ie [p* (Due) — (Dj 9)” e] 
is conserved for solutions to the Maxwell-Klein-Gordon equations. 
(a) Find the charge density p and current density J from the current in 
question #3. 
(b) Consider a gauge transformation that makes y purely a real scalar. 
How does your answer to part (a) change? 
Write the equation 

(ig D, — m) v =0 


in Hamiltonian form by isolating the time-derivative of the spinor v 
on the left-hand side of the equation. The Hamiltonian H will be the 
operator on the right-hand side of the equation. What is H? 


(a) For a wavefunction V with charge e show that 


[D,, D,] V = ieF,, V 


(b) Use the preceding relation to show that F,» is gauge-invariant. This 
provides another way to obtain the field strength of the electromagnetic 
potential. 


(a) Consider a modification to the Maxwell-Dirac equation 

(iy" D, — m) v = gev 
where o is a scalar wavefunction and g is a constant. How must o 
transform if this equation is to remain gauge-covariant? 
(b) Find the corresponding equation obeyed by v that is gauge-covariant. 
Consider a theory of one complex scalar particle with wavefunction q 


and two spin- 3 particles Y and x, each of which couples to the photon 
via the equations 


(iy^ D, — my) v = gex 

(iy^ D, — my) v = ge*v 

DD p + mg = bx 
where ma; 
particles. 


my, and my are the respective masses of the o, Y and x 
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(a) Find the most general local phase transformation of p, Y and x 
that leaves this system gauge-covariant. 


(b) What is the relationship between the charges of o, v» and x ? 


(c) In Maxwell's equations 
Of Fw =J, 


what is the current J, for this theory? 
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DOI: 10.1201/9781420083002-13 


Quantum Electrodynamics — or QED - is the quantum theory of electromag- 
netism that is founded on the Maxwell-Dirac Equations: 


O^ Ens = Iv Dy, = 0, + ieA, 
(i9 D, —m)p =0 where 8.500450, 4; (13.1) 
The current J, can be anything that obeys the conservation law 0” J, = 0. 
For fermions it has the form 


Jv = evi ybi + epay +--+ 8 (13.2) 


where ej is the charge of the “s-th” particle ( e.g. —e for electrons, +38 for 
up-quarks, etc.), and w; is its wavefunction. Each 43 obeys its own Dirac 
equation 

(iy” (O, +iesAy) — m3) vj =0 (13.3) 


where the repeated index “J” is NOT summed over, since it denotes which kind 
of particle we are considering. The ext part is called an external current; 
we include it if we want to consider particles moving in some background 
electromagnetic field*. 

QED is one of three parts of the Standard Model of Particle Physics (usually 
referred to as the “Standard Model”), the other two parts being the theories 
of the strong and weak interactions. The basic structure and foundations of 
QED [121] are incorporated into the other parts of the Standard Model, and 
so what we learn from QED will be useful when it comes to learning about 
the other parts of the Standard Model. We will also see beginning in Chapter 
23 that QED is actually subsumed into a more comprehensive theory called 
Electroweak theory, and in this sense the Standard Model has only two parts. 
For now we will treat QED separately. 

Table 13.1 recapitulates the solutions to these equations that we obtained in 
previous chapters (or that, as for the completeness relations, you can deduce 
from calculation as per the questions at the end of this chapter). 


*A background field, by definition, is assumed to be purely classical. Of course as far as 
we know quantum mechanics is pervasive and there is no physical system that is purely 
classical. So what we mean by a background quantity is one whose quantum effects are 
negligible relative to its classical ones. For example, if we act on an electron using a large 
magnet, we can generally treat the magnet and its magnetic field using classical physics, in 
which case we say that the magnet generates a background magnetic field. 


231 


An Introduction to Particle Physics and the Standard Model 


232 


qe "9 = (dm? CTS 
n A Cy? 

I^ = Otro? = (on 
0=4: 


0=z4 
i dr uod 


car (d) (to = (2) 


(+ — Y) suoyoug 


(a) oe ON je: 
8) 40) ya 


(112 2) suoruajyuy 


(a) mu OY Pac 
me 


¿U= ud 


0 = (4) yn (w — d) 
zd; - 9 (d) yn = (2) 


(7 ‘l= 2?) suoruuoq 


Ssouojgo[duio/) 


Ky [euoSoq1() 
UOL}VZITVULION, 
O) 
jurofpy 
Sjure1jsuoo 
umjuoeurom /urdg 
uorpungoAeA 
a[o1y1ed-9914 


suoryenbg IA pue [[oMxe]A 913 09 suorngog APE J-A 


Tet WISVIL 


Quantum Electrodynamics 233 
We can test QED by determining its predictions. This is done in two ways. 


1. Bound States: Electromagnetism is the force that binds atoms together. 
By considering the Maxwell-Dirac equations for bound systems we can 
search for phenomena peculiar to QED. We’ll look at this in Chapter 
14. 


2. Scattering: As with the toy theory, we wish to compute the matrix 
element M 


sum of all relevant 


—iM = (pj *** Pin M| pı +++ Pn) = (13.4) 


~~ Feynman Diagrams 


=af(p',p) +079(p',p) ++: 


where o is the fine-structure constant that we encountered in Chapter 
6, experimentally given by 
O e? _ 1 1 
^7 Amhe 137.035989 — 137 


(13.5) 


It is a dimensionless number that characterizes the strength of the elec- 
tromagnetic coupling. Since it is small, we compute all matrix elements 
as a perturbation series, using a as the expansion parameter. 


Pi Sı 


P3,€3 


FIGURE 13.1 
A general QED diagram. 
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13.1 Feynman Rules for QED 


Now let’s write down the Feynman rules for QED. For comparison I will 
include the rules for scalars where appropriate. 


1. NOTATION. Label the incoming (outgoing) four-momenta as p1, p2,.--,Dn 


(p,,pb,--. Pin) ; the incoming (outgoing) spins as 51, $9, .. , Sn (51; 52, pny Sea) 
the incoming (outgoing) photon polarizations as &1,65,... (ef, ef',...), and 
label the internal four-momenta qi, q2,...,q;. Assign arrows to the lines as 


shown in fig. 13.1. 

Note that in fig. 13.1 time flows from bottom to top, wiggly lines are 
photons, and lines with arrows are fermions (if they point upward with time) 
or antifermions (if they point downward against time). 


2. EXTERNAL LINES. Each external line contributes a factor as follows 
where I’ve included scalars for comparison. Note that the factors associated 


incoming Pa =] incoming uu, (p) 
scalars l photons 4 
outgoing” =1 


" " 
= outgoing " 
¡er (p) 


incoming SÉ anti incoming / — v(p,s) 
fermions : 
outgoing ^ fermions outgoing A” 
alpi tpi 


FIGURE 13.2 
External lines in QED. 


with external lines correspond to the incoming/outgoing plane-wave states; 
this is consistent with our scattering assumption that all particles are free 
particles in the past/future asymptotic limits. For the scalars in the toy 
theory this rule was not required because the factor was unity. 


3. INTERNAL LINES. Each internal line contributes a factor shown in figure 
13.3, where m is the mass of the particle. As before q? 4 m? because the 
particle flowing through the line is virtual (i.e. it does not obey its equations 
of motion). These internal lines are called propagators. 

'The next two rules are the same as for ABB theory. 
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scalar = 


q -m q 
fermion/ i(y-q+m) - 
antifermion 2 2 = » 
q m q 
i 
photon = qd Suv = py Y 


FIGURE 13.3 
Internal lines in QCD. 


4. CONSERVATION OF ENERGY AND MOMENTUM For each vertex, 
write a delta function of the form 


EAT (ky + ka + ka + +++ + kw) 


where the k’s are the four-momenta coming into the vertex (i.e. each k^ will 
be either a q^ or a p"). If the momentum leads outward, then k^ is minus the 
four-momentum of that line). This factor imposes conservation of energy and 
momentum at each vertex (and hence for the diagram as a whole) because 
the delta function vanishes unless the sum of the incoming momenta equals 
the sum of the outgoing momenta. 


5. INTEGRATE OVER INTERNAL MOMENTA For each internal momentum 
q, insert a factor 


(27)? 


and integrate. 


6. VERTEX FACTOR This is the rule that characterizes QED. For every 
interaction between charged particles and photons draw a point with three 
lines coming out signifying one photon and two fermions as shown in figure 
13.4. and insert a factor of —ige(Y”)ab, where a is the spinor index of the 
fermion pointing away from the vertex (the “barred” fermion) and b that 
of the fermion pointing toward the vertex (the “unbarred” one), and where 
Je = e = V4rhca is the dimensionless coupling for electrons/positrons. In 
general it is the magnitude of the fermion charge entering/leaving the vertex 
in units of the positron charge. 


7. TOPOLOGY To get all contributions for a given process, draw diagrams 
by joining up all internal vertex points either to the external lines or to each 
other by internal lines in all possible arrangments that are topologically in- 
equivalent, consistent with rule 6. The number of ways a given diagram can 
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FIGURE 13.4 
The QED vertex, where £” is the photon polarization. 


be drawn is the topological weight of the diagram. The result is equal to 
—i M. 


The next two rules apply to all theories that have fermions. 


8. ANTISYMMETRIZATION Because fermion wavefunctions anticommute, 
we must include a minus sign between diagrams that differ 
(a) only in the interchange of two incoming (or outgoing) fermions/anti- 
fermions of the same kind 
or 
(b) only in the interchange of an incoming fermion with an outgoing 
antifermion of the same kind (or vice versa). 


9. LOOPS Every fermion loop gets a factor of (—1). 


10. CANCEL THE DELTA FUNCTION The result will include a factor 


EMT (p, + ph +++ + pi, — Pi — 92 — +++ — Pn) 


corresponding to overall energy-momentum conservation. Cancel this factor, 
and what remains is —¿M. 


13.2 Examples 


The best way to learn the rules is to use them. We will consider here a 
variety of physical processes to illustrate how the rules are used in QED. 
Unless otherwise stated, all computations will be to lowest order in a. 


13.2.1 Electron-Muon Scattering 


Let's begin by looking at the simplest scattering process that occurs in QED 
— namely the scattering of one charged particle from another of a different 
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kind. Electron-muon scattering is a realistic physical example: 


e +u —e +p 


To lowest order in the coupling, there is only one diagram since the particles 
are all distinct’. Applying the rules we get 


" d* i4 > i —i v 
—iM - [E [s Pat) (ger) uo) ( 2n ) 
(27) A a L a q 
—— ———— 
rulee5  rule2 rule6  rule2 rule 3 
: RRA (-igk") vios) 
—— M ————— 


rule2 rule 6 rule 2 


x (2m) *6 (ph — q — p2) (277)*6 (p, + q — p1) (13.6) 
———M—— 
rule 4 


It's easiest to see this by taking an outgoing fermion line (e.g. the row-matrix 


FIGURE 13.5 
Electron-muon Scattering Diagram. The muon is drawn with a thicker line 
to distinguish it from the electron. 


a) (pt )), following it backward to attach it to the vertex (e.g., (—igey") ay: 
where the index y is the same as that of the “internal polarization" e” of 
the virtual photon), and then following it back to the incoming line (e.g. the 
column-matrix uf) (p1)). The result is the first term in the above expression?. 


tNote that we will get the same kind of answer for any two distinct charged fermions 
scattering off of each other (e.g., electron-quark, muon-tau, etc). All that will change are 
what we put in for the charges and the masses. 

ilve put this expression in square brackets to emphasize that this is the product [(row- 
matrix) x (square-matrix) x (column-matrix)]. 
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Doing this for the muon line gives the second term, where we must attach a 
different polarization e” to the photon at that vertex. The indices on the 
internal photon line are chosen so as to join these together with a metric, 
whose indices are the same as the two y's. 

Doing the integral in (13.6) is easy, since it is just an integral over delta- 
functions, just like what we did in eq. (10.9); carrying out the same procedure 
and simplifying a bit gives 

» 
-iM = [i (p) ue p] — 5 [uc pyu p| — a3 
(p, — pı) 
Notice the structure of M in eq. (13.7): it is of the form 
(current)" x (propagator) x (current), 


This is the general form of all fermion-fermion interactions. 

To proceed further we would have to put in the initial conditions for the 
electron spin-state uU?) (p4) and the muon spin-state u(2)(p2), as well as the 
final states for each. The resultant matrix element M would then be squared 
and put into a cross-section formula to yield a predicted scattering rate. This 
is very tedious and we won't do this here — we will see later that there is an 
alternative method that avoids resorting to such initial and final conditions. 


13.2.2 Bhabha Scattering (electron-positron scattering) 


'The next simplest process is that of the scattering of a particle off of its own 
antiparticle. Let's consider electrons scattering off of positrons, since these 
kinds of beams can be easily produced in a laboratory setting: 


e tet — e tet 


To lowest order in ge we have two diagrams, shown in fig. 13.6 where the one 
on the right is similar to the one we computed in eq. (13.6): 


Might = 2 [uo (i) (—ige") up.) (=) 
x [B (pa) (~igo) v9 (ph) 


x (2m)*6 (pj — q — pa) (27)% (pi +q- pı) — (13.8) 


except that we have positron-wavefunctions v!) (p) instead of the muon wave- 
functions’. 


SThe easiest way to construct the [oo (pa) (—tgey”) v) (v5) expression for the positron 


is to follow the arrow backwards from bottom to top. The incoming antifermion yields a 
factor of (o 62) (p2)), ; the vertex yields the term (—igey”),,; and the outgoing antifermion 


yields a factor of (v6 (22). yielding the product (262 (pa), (-igeY” oa (v6 (92)) 


Summing over the repeated (a, b) indices gives [52 (pa) (=igey”) v6 (pf) 
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FIGURE 13.6 
Bhabha scattering to lowest order. 


The diagram on the left represents annihilation of the electron and positron 
into a single virtual photon which then produces an electron-positron pair. 
Here the rules give 


d*q 


AM of = I en: [uo (y) (—igey") v2 (pl, )] 2 
« Pen ig] 
x (2748 (p, + p3 — q) (27)*5 (a — p — p2) (13.9) 


Now we have to apply rule 8, which tells us whether or not one diagram gets a 
minus sign (i.e. whether or not we add or subtract these two diagrams). This 
means that we need to see if these diagrams are the same or not if we switch, 
say, the incoming electron with the outgoing positron. From figure 13.7 it's 
easy to see that they are, and so rule 8 says that these diagrams subtract. 


P p' 
P, D Switch P' p d 
Y incoming N, 
IWAVAVAY, electron = 
Cw. | with DN 
Pi outgoing Pr 
pi positron Pi 
Pi 
P3 


FIGURE 13.7 
Equivalence of the two diagrams under particle interchange. 


240 An Introduction to Particle Physics and the Standard Model 


Hence we obtain 
iM = Mort — (-iMignt) 
;52 
; zx is 1e 
= (—ige)? [u (piv (95)] 


(pi + p2) 
2 


z [v6 (paju (pi). 


ie 


- (~ige)? [EA tu (o) [pro (04) 13.10) 


/ 


2 
(pj = pi) 
for the matrix element for this process. 


13.2.3 Compton Scattering 
This process involves a photon scattering off of an electron: 
e ty—e€ +y 


Again we have two diagrams to lowest order as shown in figure 13.8 and now 


P, , p 
Pa Pı € 
p E 
+ un 
> 5 
n, l 2 
Pi P 
Di Po 
FIGURE 13.8 


Compton scattering to lowest order. 


we add them since fermion interchange is irrelevant. Applying the rules now 
gives 


4 / u(y: m . 
My = 00 | cs ETT ige'y") Qr À( 1909) Xp) 


gq? — m? 
xe, (pa)(2n)46 (p, 4- p5 — q) (27)*6 (q — pi — pa) (13.11) 
and 
x dq [a ' i(y:q--m os capi 
¡Mega = e) f hs 6994) (ioc LE ig foo] 


xeu (pa) (2)*0 (po + q — pı) (21)* (pi — a — p2) (13.12) 


Note that the only differences have to do with where the polarizations contract 
and with how the delta functions conserve momenta. 
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13.3 Obtaining Cross-Sections 


In order to finish any of these calculations we would have to specify all of the 
initial and final spins in the problem, insert the corresponding free-particle 
solutions into the expression for M and then integrate over the solid angle to 
get the cross-section. This is enormously tedious and fortunately unnecessary 
— as we discussed in Chapter 9, most particle beams are unpolarized, with 
spins randomly distributed along any axis, and most particle detectors count 
particles regardless of spin. We therefore apply the spin-summing/averaging 
trick of eq. (9.36): 


par = (1) Y. x mP (13.13) 


iiam ST 


where the factor of ‘ey arises because we have two initial spins to average 
over. The expressions then simplify due to the following identities 


UT) = (P+), D PO P (pm), (13-14) 
i2 i2 
which you can prove yourself. These identities are used in what are called the 


"Casimir tricks" [122]: 


AS [690r gue?) 


adn 
= Tr [Ty (p, + ma) Ty (gg ms) — 1345) 


Y) PWET pn] ware pn) 


ia iB=Î,} 
= Tr [Ty (p, - ma) Pu (~,-me)] (1316) 
Y [s oA) qué pg] [v 9 (pa) gru 6? (pg) 
ia,ip=TL 


= Tr [Ty (p, - ma) Ty (pp + mu)] — 0347) 


for any two 4 x 4 matrices | and l'qj, where Ty = 4? Thy. I'll relegate the 
proof of these relations to an Appendix. For now, let’s use them to finish the 
calculations. 

Let's consider electron-muon scattering. First we need | M[": 


IM}? = MİM 
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4 


x (Pe eano] fa asa) (13.18) 


Note that I’ve done two things here. In the first line, I relabeled the Greek 
index p (in the M part) to v (in the M part) so I won't get confused between 
the two. Then, in going from the 2nd line to the 3rd line, I regrouped the 
square-bracketed pieces with common factors of momenta so that I can use 
the Casimir trick. For this particular case, the 1 and Py] matrices are going 
to be given by y* and y”. The trick implies that 


27 fat (py) (pi) [2 (pL ou (pi) 


1 1 =9,1 


= Tr [7 (p, +m) 7 (p, +) | (13.19) 


i2,i5=7,1 


= Tr |7, (p, +M) v ($, + M)] (13.20) 
where m is the electron mass and M is the muon mass. So we get 
2 
mar=(5) x X wf 
iniz=1,1 14 4=9,1 
AE ais Ns 
la) "Dr 9 0 9] 
xT [5, (p, + M) v (p, + M)] (13.21) 
and so we see that spin-summing/averaging has turned the calculation into a 


problem of computing products of traces over ^-matrices! 
How do we do these? The first thing to note is that 


qu = 9 (qut 49 = qu (13.22) 
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which you can show by brute force from the definition of the y-matrices. The 
next step is to note that, since the y-matrices are 4 x 4 matrices, we get the 
following two simple identities 


Tr[/]] 24 and Tr 5"4"] = 4g"" (13.23) 


proved in problem 2 from Chapter 11, along with other results for traces and 
products. These are listed in section 13.4.2 of the appendix for convenience. 
Using these results we find 


T [^ (8, +m) 1 (p, +m)] 

=Tr KALA +m? Tr [yy] 

= Tr rnv s] + mr [7] 

= Tr [Y (913^) Y (01,7)] + 4n? gt” 

= piap Tr [a y 7] + 4m? gt” 

= Api pin (g^g"? — gh” g^" + gg) + 4m? gh 

=4 (pi py + pr py +91” [m — pi - p1)) 

= AL" (p, pi; m?) (13.24) 
where note that in going from the 3rd to the 4th line the p's come out of the 


trace because they are not matrices. In going from the 4th to the 5th line I 
used the relation 


Tr [qao ert peat (g> g" = g g% + qeu) (13.25) 
listed in the appendix. Similarly 
Tr 7, (y, T M) Ww (5, + M)| =4 (PouP2v + Poy Pon + 9uv [M? — Po : p2]) 
= AL, (po. p2; M?) (13.26) 
So the matrix element is 
2 


2 

——9 16 e 

[M] SANG ge A LH” (p, py; m?) Ly (po, po; M?) 
(p, — pi) 


2 
2 
=4 (a) (pipi + pr pl + gt” [m — p -p1]) 
1% "Pi 
X (PouP2v + PoyP2u + Juv |M? — p3 - p2] ) 


=4 (a) [2 (p1 - pa) (P1 + P2) + 2 (pi : pa) (m: pa) 
(pj — pı) 
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+2 (p3 - pa) [m? — py pi] 205: p1) [M? — po - p2] 
+4 [m? — pj - pı] [M? — py - pa]] 
e? a 
=8 (=>) [(p1 - p2) (Pı + p2) + (p1 > P2) (i ` P2) 
1 
— (py: p1) M? — (py + pz) m? + 2m? M7] (13.27) 


which is a Lorentz-invariant scalar! 


,— ® O 
l P» 
N 


FIGURE 13.9 


We could now evaluate this quantity in some frame of reference and compute 
a cross-section from it! For example, in the lab frame, with the muon initially 
at rest as illustrated in figure 13.9: 


pi =(E,p)  p=(M0)  py-(E.P) (13.28) 
and we need to compute all of the “dot products” in the expression. These 
are: 

Pi:p2— EM pj:pog — EM py: pi = EE’ — |p||p '|cos0 
mp, = Pı ` (pit P2 — P1) = m? + EM — (EE' — |p] |p"| cos 0) 
pp, = Pi (pit pa — p) = (EE' — |p| |P "| cos 9) + E'M — m? 
po- p2 = p2- (pı + pa — gj) = EM + M? — E'M (13.29) 
(p, — pi)” = m? +m? — 2 (EE' — |p| [i "| cos 6) 


and so 
(p1 + P2) (px ` p2) + (vi > P2) (n: P2) — (P1 + p1) M? — (p3 + pa) m^ + 2m? M? 
= [(EE' — |p| |P '| cos 0) + E'M — m?] EM 
+E'M [m? + EM — (EE' — |p| |p'|cos0)| — [EE' — |p| |B *| cos 6] M? 
-[EM + M? - I'M] m? + 2m? M? 
= M (E' — E + M) |p| |p| cos0 + M? (EE' +m’) 
+M (EE' — 2m?) (E — E?) (13.30) 
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giving 


© OA [m — (ER — [il "| cos)" 
x [M (E' — E+ M) |p| | '| cos 0 
+M? (EE' +m?) + M(EE —2m?) (E — E')] (13.31) 


my 8e4 


which can be put into the lab-frame cross-section and evaluated: 


2 3121272 
10 dee - Mp! (13.32) 
dQ) yan 87) M |p| llp*| (E + Mc?) — |p| E" cos 6] 


The preceding expression is quite cumbersome, but we can understand its 
physics using a simple approximation. Suppose that the muon is so heavy 
that we can neglect its recoil. This implies that M >> E, E' and |p| œ |P], 


EcE x~ Vif? + me so eq. (13.31) reduces in this limit to 
8etM? li? (1 + cos 0) + 2m? | e^ M? lat? cos? f +m? 
i 4 |p1* [1 — cos 0]? B 


[p - (13.33) 
Il sin? 2 


and eq.(13.32) becomes (putting into M the correct factors of c and remem- 
bering that e = v4ra ) 


(z) = ( h jum [Lo cos? $ + (mo)”] 


dO 87M 51! sin‘ 2 
2 
ha 2. 920 | 
= Ie aye a cos! = + 13.34 
(sse) a cos? + one?) — 1339) 


which is called the Mott formula [123]. 

'The Mott formula is a good approximation for electron-muon scattering or 
for that matter electron-proton scattering if we take M to be the proton mass. 
Note that if the incident electron is non-relativistic then |p| ~ m |v| << (mc) 
and eq. (13.34) reduces to 


2 
do hea do 
I ~ | ——~_ ] = [| — 13.35 
(i ) Mott | 2mv? sin? g ) ( da ) Rutherford ) 


which is the original Rutherford scattering formula [18], first developed by 
Ernest Rutherford to explain the data from his experiments scattering alpha 
particles off of a gold foil. Mott’s formula was developed by Neville Mott to 
describe the analogous process when alpha particles are replaced by electrons 
which, being much lighter, have relativistic momenta necessitating use of the 
more detailed formula (13.34). 
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13.4 Appendix: Mathematical Tools for QED 
13.4.1 The Casimir Trick 


It is very common to encounter terms of the form [u 4) (p4)Pu 9) (pg)] when 
evaluating Feynman diagrams, where T will be some product of matrices (typ- 
ically composed of y matrices). The Casimir trick helps us deal with these 
quantities when we employ spin-averaging and spin-summing to our expres- 
sions. 

Consider the following expression: 


tora 
ESA - ull) pra (4). 

= [ut (p) Thyul"(p4)| 
= [i (pB) Thu (4) 


= [2 wa Tu (pa)| (13.36) 


The second line follows because (7°)? = I and because u! = ((u') y = lu 
(since (y)! =°). The last line follows from the definition Py = PE. 
Now consider 


[ap JC puta (pg). [n9 (wal pu (pp) 
= Y) Pr] ra | 


a a") (pg) (Ty) a, uf ^ (pau (pa) (Cir) oa u$ (pg) (13.37) 


where in the last line I have explicitly written out all of the Dirac indices that 
are summed over, so that for example P¡qu('» (pg) = I - (Ein); utm (pp) — 
(Ti) i ule ) (pp) using the summation convention. We can now sum over the 
spin-indices: 


Y. ou (pg) (Ty), uj (pat (pa) Cir) a ud (pa) 


iA,ip—T.l 

= Y aps) Tp), | X ui? (paul (pa) | (Cy) aut (PB) 
ia=T,L iB=f,} 

= YI Pps) Pa, (p, + ma), Cr eat” (oa) (13.38) 


ia=T.l 
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where the last line follows from the identity 24s u (pya® (p) = (p + m) m 


Now I can do the other spin-sum by moving the components of mus pg) all 
the way to the left (since they're just functions, this is allowed) 


27 a”) (pg) (Tr) a (p, + ma) » (Tr) ul”? (pg) 


¡a=7,1 


y ul®) (pp yat») (pg) (Ty) os (pa ate ma) [Eme 


¡a=7,1 
= (ts 2 mp) (Tr) a (pa F ma) (Ur) (13.39) 
da bc 


using the same identities as before. Note now that the last line is just the 


trace of the product of the 4 matrices (ps + mp) Dg (pa +ma), and Tig- 


Hence we find 


Y, [ap ui (o) [n6 pa) qué (pp) 


ia,ip=T,l 


= (ps t mg). (Tr), (pa * ma) (rr). 
= Tr | (pp + mp) Ty (p, ma) Py] 
= Tr [Ty (p, ma)Tg (pg ma)| — 1340) 


where the last line follows from trace cyclicity (Ir(. AB) =Tr(BA)). This is 
the Casimir trick. By similar reasoning (try it!) you can show 


D PUPO p| [9 (armo) 


ia iB=Î,} 
=T [Ty (p, ma) Tu (pp, ma)] 0341) 


D POPU p] fo par (pg) 


ia,ip=T,l 


= Tr |T (p, - ma) Ty (py ma)| (1342) 


13.4.2 Dirac y—Matrices and Their Traces 


0. . I 0 dirus 0 c! 


and obey the relation 


The y—matrices are 


tr, y) = 2g” (13.44) 
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We define some auxiliary matrices via the relations 


. V Ü V 
PERCE A (13.45) 
Oi i 0 c' il ot 0 


Note that we now must distinguish between raised and lowered indices: 4? = 


yo and y? =—y;. The following identities can be directly verified: 


(1°)! =Y Qn = 2 E Ua 
IL pop ew e 

VN IE a Va| = 24 
Ya A = Ag" Yapa“ = 4a -b 


PIPI = —2y yy" apb“ = -244p 


Using these relations, we get the following trace theorems 


Tr [Z] = Tr [odd # of y-matrices] = 0 

Tr [y^4"] = 4g"" Tr [dp] = 4a - b 

Tr ye a = 4 (ght ge? = git g’? + gh gee) 

Tr dped] =4(a-bc-d—a-cb-d+a-db-c) 

Tr vñ] =0 Tr [žy] = 0 
Tr y yr y Bl = 4il Tr MAP = 4iet”P a bycadg 


where &e""?? is the 4-dimensional Levi-Civita symbol (or epsilon-tensor) which 
obeys 


¿0123 = a] £0123 = +1 
uvaB _ J —1 (if uvaß is an even permutation of 0123) (13.47) 
" ~ | 44 (if yvaf is an odd permutation of 0123) í 
_ [4 (if uva is an even permutation of 0123) 
QURE S \ 1 (if uva B is an odd permutation of 0123) 
Using this it is possible to prove the following 
pe Tapes — 5% 5050 — 620267, — JR OX OS 
v eB sa asu Qa sv 
+67 552 + 006207, + 09 020% (13.48) 
EBay = —2 (8200 — 626°) (13.49) 
PrP e sax = 60% (13.50) 
gt Pe rag = —24 (13.51) 


as I will later demonstrate in the appendix of Chapter 20. 
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13.5 Questions 


1. Consider a theory in which the electron couples to a massless pseu- 
doscalar particle (the ¢) with the vertex 


= igy 


FIGURE 13.10 


(a) Draw the lowest-order Feynman diagrams for the process et --e^ — 
29. 

(b) Compute to lowest order the differential cross-section for the process 
in (a) in the CMS frame. 


2. For the theory in question #1 
(a) Draw the lowest-order Feynman diagrams for the process ¢ + —> 
et+e. 
(b) Compute to lowest order the differential cross-section for the process 
in (a) in the CMS frame. 

3. For the theory in question #1: 


(a) Draw the lowest-order Feynman diagrams for the process p+e” — 
ote” 

(b) Compute to lowest order the differential cross-section for the process 
in (a) in the lab frame. 


4. The lowest-order diagram for the process e” +47 —> e +p” is shown 
in fig. 13.11. Draw (but don’t calculate) and label all relevant diagrams 
to next-lowest order in the coupling constant. 


5. The lowest-order diagrams for the process et + e7 — et + e^ are 
shown in fig. 13.12. Draw (but don’t calculate) and label all relevant 
diagrams to next-lowest order in the coupling constant. 
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FIGURE 13.11 
Electron-muon Scattering Diagram. The muon is drawn with a thicker line. 


P» 


FIGURE 13.12 
Electron-positron Scattering Diagrams to lowest order. 


6. Consider the Dirac equation in (1+ 1) dimensions (1 spatial + 1 tem- 
poral): 
(i^, — m) =0 
where the index y = (0,1), 8, = (4, 2) and the 2x2 y-matrices are: 


10 0 1 
O. l.. 
vA) v-() 


(a) Compute (4",4"). What is the physical interpretation of this 
result? 


(b) Find the solutions to the Dirac equation in (1 + 1) dimensions. Phys- 
ically interpret your results. Do the particles have spin? Are there 
antiparticles? 


(c) Apply the principle of local gauge invariance to this equation and 
write down the results. 


(d) Show that the quantity J, = ew yw is conserved. What is the 
physical interpretation of the components? 

7. Find all diagrams of order e* for the processes below. Label them but 
do not calculate them. 
(a) et -y — e* tv 
(b) y+y— et «te 


14 
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In Chapter 13 we saw that the matrix element for electromagnetically scat- 
tering an electron (mass m, initial momentum p1) off of a muon (mass M, 
initial momentum p$) is, from figure 14.1 


FIGURE 14.1 
Electron-muon Scattering Diagram. The muon is drawn with a thicker line 
to distinguish it from the electron. 


e2 


M=i [up tue (ps) (pi — pi) 


5 [uw uu (| aD 


which after spin-summing/averaging boiled down to 


sz)? 8e*F(E, M, m,0) 
M — A[m? — (EE' — |p| [7 "| cos 6)]" 
with 
F(E, M,m,0) = [M (E' — E+ M) [pl lp "| cos 0 (14.2) 


+M? (EE' +m?) + M (EE' - 2m?) (E — E’)| 


in the lab-frame. This is an elastic scattering process (A+ B — A+ B), so 
we obtain in the Lab frame 


pi =(E, p) p!-(E,.p) p=0 Ez=M 
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(where the muon is at rest) the expression 


do (AY sm] Ie’? (14.3) 
dO 187) M |p| lp" (E+ Mc?) — |p| E' cos] 
where the statistical factor S = 1. 

The final energy E” of the muon is determined in terms of the initial condi- 
tions (E, p, and M) and the scattering angle 0, which is the only independent 
variable. We can see this by noting that momentum conservation gives 

p7 = (p2 + pı — p1)? = M? = M? + pi +p? +2 (p2 + (p — p4) — m: Pi) 

>0=m?+M(E-E’)—(EE' — |p||p '| cos) (14.4) 


Proceeding along the lines given in Chapter 2, we solve for E” 


(M + E) (m? + ME) + |p)? cos 04/ M? — m?(1 — cos) 


E' = 
(M + E)? — |p]? cos? 0 


(14.5) 


in terms of (E, p, and M). When the mass m of the electron can be neglected 
we obtain 


AA (M + E) (0 + ME) + |p]? cos 0V M? 

i (M + EY — |pl? cos? 6 
ME(M + E) + ME?cos0 
[((M + E) — E cos 0] [(M + E) + E cos 6] 

ME 


= E(1— cos0) + M (146) 


which is the result (2.39) that we found in Chapter 2! 


14.1 Basic Features of QED Scattering 


Putting all of this together — inserting eqs. (14.2) and (14.5) into eq. (14.3) to 
get the resultant differential cross-section — gives a very complicated looking 
formula! Despite this, we can extract some meaningful physical insight from 
it by looking at some key features it has in common with other 2-fermion 
QED scattering diagrams. 


14.1.1 Coupling 


Note that d 
DÀ o 
|M] «etna? 2 — xa? 


de (14.7) 
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This is because the amplitude has two vertices, each with a factor of e ~ 
Va. The next order correction goes like af, and higher orders contribute 
further corrections proportional to even powers of a. The power of o from 
any diagram is always a”/? where n is its number of vertices. 


14.1.2 Propagator 


'The denominator of the matrix element comes from the internal line, which 
gives a factor 

1 1 
(p =p)  2[m? -(EE'- |p| |P "| cos 0)]? 


1 m\2 (M\? 
“Eras I (5 | e 


1 
i 14.8 
8 (EE'Y sin’ 2 (148) 


at high energies, where|p| ~ E and |p "| ~ E”. If the matrix element had no 


other structure, then de x EI which is the salient term in the Rutherford 
2 


scattering formula (13.35). The exchange of a single virtual particle between 
two scattering particles will always produce a sin? g factor in the denominator 
at sufficiently high energy. 


14.1.3 Matrix element 


The numerator of the matrix element is 


F(E, M,m,0) = [M (E' — E + M) |p] |B "| cos 6 
+M? (EE' +m?) + M (EE' - 2m?) (E — E’)| 


0 
c 2M? EE’ cog? 5 (14.9) 
again at high energies, where|p| ~ E and |p '| ~ E”, indicating that in this 
limit do x cos? g. This feature is due to the spin of the fermions and does 


not occur in a scalar theory like ABB theory. The reason is due to helicity 
conservation. Suppose that at high energies, the incoming electron is moving 
along the z-axis with spin up (i.e., spin aligned with its direction of motion). 
It will be in the state 


E 1 
NAT (5) ; 
uD (p) = V2m EA c 
2m (p-0)€ 03 


| 
4M 
MS 
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whereas the outgoing electron — which is scattered at an angle 0 — is effectively 
in a frame that is rotated by an angle 0 relative to the z-axis. Setting, say, 
0 = 0$ , it has the form 


u (p') = 


O y ir oO 
A (cos 5 t 102 sin 2) 0 (Da 
=( 0 (cos £ + i» sin $) ap 00) 
cos 0/2 sin 0/2 0 0 
| —sin@/2 cos 0/2 0 01 75 
= 0 0  cos0/2 sin0/2 2 
0 0 — sin 0/2 cos 0/2 


(14.11) 


in the same high-energy limit. This gives 


0 0 0 0 
T (p)g9u(D (p) ~ V EE! c 5 sin 57008 5; sin 5) CUR 


ORO 


6 
= 2VEE’ cos 5 (14.12) 


and so Ju (pul (pl ~ EE’ cos? $, consistent with our formula above. 


14.1.4 Dimensionality 
At high energies, we have 
EE 1 
EE) EE 


do 


1 


2 do M2 E 
|t] A and zg x M] EU 
because c is like an area~(length)? ~ h?/(momentum)? ~ (hc)?/(energy)?. 
At high energies particle masses are negligible, and dimensionality forces the 
cross-section to behave like 1/E? ~ 1/(CMS energy)? since this is the only 
energy scale in the problem. This feature is shared by all 2-Body cross- 
sections in QED. 


14.1.5 Antiparticles 


We would get exactly the same answer for positron-muon scattering, or electron- 
antimuon scattering, or positron-antimuon scattering. This is because QED 
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is charge-conjugation invariant. 


14.2 Major Tests of QED 


'There are a number of different ways to test QED. The most important of these 
have to do with measurements of scattering processes, anomalous magnetic 
moments, atomic energy level shifts, and changes in the coupling strength. 
Let's look at a brief sketch of each. 


14.2.1 Scattering Processes 


There are various scattering processes of interest in QED, which are given 
in the diagrams in figures 14.2 and 14.3 These can be computed using the 


+ 
Pair annihilation 


<= + | » Pair creation 
+ 
Compton Scattering 


FIGURE 14.2 
Lowest order inelastic scattering processes in QED. 
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e-u scattering 


Electron-electron S 
scattering 
(Moller scattering) + 


Á 


Electron-positron 
scattering 
(Bhaba scattering) + 


FIGURE 14.3 
Lowest order elastic scattering processes in QED. 


diagrammatic formalism of Chapter 13, and the resultant cross-sections have 
been shown to agree with experiment to a high degree of precision. 


14.2.2 Anomalous Magnetic Moments 


All charged objects with angular momenta have magnetic moments, so we 
expect electrons (and muons, and taus) to also have magnetic moments. An 
object with a magnetic moment ji will (non-relativistically) have its interac- 
tion with a magnetic field B described by a term in the Hamiltonian that is 
proportional to the magnetic field. This is 


, " h = 
Hmag = —ji- B where ji = gupS = g 8 (14.14) 
2mc 


> 4l? 
where S is the spin operator with eigenvalues |5 | = s(s + 1) for a particle 
of spin-s, and ug = eh 5.78838263 x 107? eV T-! is the Bohr magneton 


2mc — 
in units of electron volts per Tesla. The quantity g is called the gyromagnetic 
ratio of the particle. It is the ratio of the magnetic dipole moment to the 
mechanical angular momentum of a system — a dimensionless number that 


indicates the strength of the magnetic moment in Bohr magnetons. 
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The Dirac equation predicts g = 2. We can see this by rewriting the 
Dirac equation so that it is in non-relativistic Hamiltonian form. Writing 


Ay = (o. A), we have 


o > E 
(n^ D, - mé = 0S iE = [Lej V eph — e Ay + my] 
(14.15) 
Anticipating taking the non-relativistic limit, let's write y = e "t ed and 


insert it into (14.15). This gives a pair of 2-component spinor equations 


Ó 3 

ix = edytéa- (5- eA) x (14.16) 

19) 2) 

D = (-2m + ed) x  8- (s eA) e (1417) 
where p — —iV. So far this is an exact result. Now let's take the non- 


relativistic limit, where we have 


m >> e, 


Ox 
x (14.18) 


and so the 2nd equation has the approximate solution 


m di s cA) 7 (14.19) 
> ive ~ epp + x le: (5- A) o (14.20) 


where eq. (14.20) is the Pauli equation (named after Wolfgang Pauli who 
first proposed the concept of electron spin [124]). It is an equation for a 
2-component spinor y that generalizes the non-relativistic Schroedinger equa- 
tion to include spin! 

Now we can use the identities (5.32) ojo; = dij + t6:;~n0% to write 


> 


ES (s-eA)| = |z- ell — es. B (14.21) 


the proof of which I have left as a problem. This gives 


12 
Op [|e eA € | 

~ 5-Blo=H 14.22 
UA | 3m ep — 5a ie p ( ) 


'The first two terms correspond to the Hamiltonian for a particle of charge 
e moving in an electromagnetic field with vector potential A and Coulomb 
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potential $. Since S= g for a spin- 4 particle, the last term on the right- 
hand-side suggests that we identify 


Hmag = —L—8: B = -2— (2.8) md (8. B) (14.23) 
mc 

(putting in the correct factors of A and c) which implies that g = 2 for a Dirac 

particle of charge e, as promised". 

Here then is a clear prediction from the Dirac equation, one that can be 
easily compared to experiment by putting an electron (or muon or tau) in a 
magnetic field and checking the value of the coupling. The g-values of the 
e and y” leptons have indeed been measured — but they disagree with the 
value g = 2 by ~ 0.2%! It’s a tiny discrepancy — but one that is not within 
limits of error. 

'Thus the Dirac picture of a pointlike e^ and p^ is not exact. Why? 

'The actual reason has to do with quantum field theory: the quantum for- 
malism in which the wavefunctions of the e” or y” and photons are themselves 
quantized so that they can create or destroy particles. This feature allows the 
e” (or 7) to continually emit and reabsorb its own photons. From the dia- 
grams in figure 14.4 we see that the g-factor should be corrected. The actual 
computation of these diagrams is quite tedious — I’ve put in the answers that 
come from computing them. The diagram at the bottom comes from adding 
up the two diagrams in the middle row of 14.4. Upon comparison with Hmag 
above, we see that 

g=2 (1 de 2) (14.24) 
277 
and so the magnetic moment receives a correction of order o with a coefficient 
of x, responsible for the 0.2% correction noted above and first obtained by 
Schwinger [125]. 
We also see that the charge of the electron (or muon) is corrected by a term 


of order « 
d 
Cetectron = € (1 ape I 2) (14.25) 
mJ q 


but with a coefficient of infinite magnitude! 

What to do? This is the problem of diverging quantities noted earlier in 
the ABB theory. The results of quantum field theory indicate that physical 
quantities are perturbatively corrected in powers of the coupling a. Sometimes 
these corrections are finite (as with the magnetic moment), but sometimes 
they are not. For the electron charge this problem is dealt with by regarding 
the charge e as not being directly measurable — instead, what is measured in 
the lab is the quantity e above. It is the quantity e that is defined 


electron electron 


*Experimentally the magnetic moments of the neutron and proton differ widely from this 
value. As we will see beginning in Chapter 16, this implies that these particles have internal 
structure. 
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FIGURE 14.4 

Lowest-order diagrams contributing to the anomalous magnetic moment of the 
electron. Only the interaction terms the arise from computing the diagram 
have been written. 


to agree with experiment. The quantity e (the bare charge) is assumed to be 
divergent in such a manner as to render Esectron finite. 

As with the ABB theory, this crazy idea works because QED is renormal- 
izable: once this redefinition of the charge is made (plus a similar redefinition 
of the mass and the wavefunctions), all other quantities in QED (e.g., cross- 
sections, decay rates) are finite and uniquely calculable. One of these quan- 
tities is the magnetric moment, given above, which to this order is predicted 
to be g = 2 (1 + &) = 2.00282... 

Can we test this? The answer is yes, and the comparison between theory 
and experiment is nothing short of spectacular [126]. The above procedure 
for computing the magnetic moment has been carried out to 5th order in a. 
These computations involve ~1,000 diagrams. The result for the electron is 
[1277] 


(222) m > — 0.328478444 ey + 1.181234017 ey 


—1.7283 (y - 0.0 c) + 1.71173 x 1072... 


= .0011596521884 + 0000000000043 
1, 159, 652, 188.4(4.3) x 10? (14.26) 


The first term is represented by a Feynman diagram with one closed loop, 
and requires no more than a page or two of hand calculation. The second 
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term represents seven diagrams, and originally took seven years to calculate. 
Seventy-two Feynman diagrams are needed for the third term, and all of 
them have been evaluated exactly using symbolic manipulation programs on 
computers, after nearly thirty years of hard work. The fourth term requires 
the evaluation of 891 four-loop Feynman diagrams, and has been estimated 
numerically using large-scale computations on supercomputers. The fifth term 
is even more laborious, and the coefficient 0.0 has an error of +3.7. The last 
term (not involving the fine-structure constant) is a small correction caused by 
particles in the loop other than the electron, and strong and weak interaction 
corrections. 
The particle data book lists the experimental value as [1] 


mE QED-expt 
(5) = 1,159,652, 181.11 + (0.074) x 10? (1427) 


e 


which is a world average over all experiments. This agreement between the 
theoretical value above and the experimental value(s) given here is the most 
precise for any non-gravitational theory and experiment to date. 

To properly compare experiment and theory, we need to know the value of 
the fine-structure constant. The measurements of the value of this constant 
aren't as accurate as those of (g — 2)! There are many ways of doing this 
[127]. For example one can measure a using the quantum Hall effect, get 
1/137.0360037(27) (an accuracy of 0.020 ppm), and predict 


= 1,159,652,156.4(22.9) x 10? (14.28) 


y — 24 QED/aH 
Us) 


€ 


One can measure o using the ac Josephson effect to be 1/137.0359770(77) 
(0.056 ppm), and predict 


(2 y a 


a = 1,159, 652,378.0(65.3) x 1071? (14.29) 


e 


Or, one can measure Planck's constant Á and the mass of the neutron, and 
derive o to be 1/137.03601082(524) (0.039 ppm) to predict 


_9\ QED/n 
(5) = 1,159, 652, 092.2(44.4) x 10? (14.30) 


e 


The numbers in parentheses are due to the uncertainty in the experimental 
value of a; the errors in the computer-measurement of the theoretical formula 
is much smaller (plus or minus 1.2). If you trust QED, you can work backwards 
to an even better estimate of o: 1/137.03599993(52), with an estimated error 
of 0.0038 ppm. 
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For the muon the result is 


QED 2 3 
g-2 IH. Q 
(>) = Š —0.765857408 (2) + 24.05050959 (5) 


+130.9916 Sl + 663 or + 709.4 x 10719 (14.31) 


where the last part is due to strong and weak interaction corrections. The 
best theoretical calculations yield 


pure 
QED 
including strong 


E e 2) QED 1,165,847, 181.0(1.6) x 107*2 


2 Jy 1, 165, 917, 880(20) (460) (350) x 10-12 


& weak corrections 

(14.32) 
where in the top line the quantities in brackets indicate the error due to un- 
certainties in the fine structure constant, and in the bottom they indicate the 
errors due to electroweak, lowest-order hadronic, and higher-order hadronic 
contributions, respectively. The best data [1] indicate that 


g-2 expt 
(23) = 1,165, 920,800(540)(330) x 1071? (14.33) 
2 Jy 
where the errors in brackets are due to statistical and systematic errors re- 
spectively. 

As you can see, strong and weak interactions provide significant correc- 
tions that are still not fully understood to 12 decimal place precision. The 
discrepancy is 


A ES = Aa, = 292(63)(58) x 1071! (14.34) 


which is 3.4 standard deviations away from the expected result. 

Experiment E821 at Brookhaven [128] have indicated that there is some- 
thing we don't understand about the anomalous magnetic moment of the 
muon. The current situation (as of June 2009) is described in figure 14.5. 
Notice that theory and experiment do not agree within limits of error: the 
present experimental value differs from that predicted by QED, or more prop- 
erly the Standard Model, since relevant weak and strong corrections have been 
included. The uncertainty on the Standard Model theoretical values is dom- 
inated by the uncertainty on the lowest-order hadronic vacuum polarization. 
This contribution can be determined directly from the annihilation of e*e^ 
to hadrons through a dispersion integral (the lower open circle point). The 
indirect determination using data from hadronic 7 decays, the conserved vec- 
tor current hypothesis, plus the appropriate isospin corrections, appear to 
improve the agreement between theory and experiment, but nobody knows 
why. 
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FIGURE 14.5 

Results for calculations of the anomalous magnetic moment of the muon, 
subtracted by the central value of the experimental average, given in by the 
bottommost point on the right. The shaded band indicates the experimental 
error. The Standard Model predictions (DEHZ, DH, EJ) are from various 
groups doing calculations that take into account different ways of incorporat- 
ing hadronic corrections. Image courtesy of Particle Data Group [209]; used 
with permission. 


A definitive understanding of the muon g factor will have to await further 
refinements of both the experimental and the theoretical values. Major ad- 
vances may be some time in coming. On the experimental side, the E821 
project has been shut down by the United States Department of Energy, at 
least for the time being. From the theoretical viewpoint, the next major stage 
will require evaluation of the five-loop Feynman diagrams. There are 12,672 
of those. Anyone eager to do this calculation? 


14.2.3 Lamb Shift 


The Dirac equation can be solved exactly in the presence of a Coulomb po- 
tential, where A, = (42,0). The computation is a tedious but analogous 
extension of the computation of the energy levels for a Hydrogen-like atom 
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using Schroedinger's equation. For the Dirac case, the energy levels are also 
fully and uniquely calculable and the result is [129] 


2 n = principal quantum number 
E= Mye 213 5 
= (Za)? J = 9990 90-°° 
14 2 — total angular momentum 
[n- G-1/2)-/G-1/2? -(Za?] 
zd. d (Za)? , 3(Za) (Za , (14.35) 
nM 2 m 8 m  m(d-- ' 


where m, is the reduced mass of the electron and Ze is the charge of the 
spinless nucleus. The expansion is in powers of Za, and the first non-trivial 
term is the usual part that one gets from the Schroedinger equation. 


te Triebwasser et al. [8] 
Robiscoe and Shyn [9] 6 
Shyn et al. [10] 

Safinya et al. [11] 

Cosens and Vorburger [12] 


H Kaufman et al. [13] 
Newton et al. [14] --S 
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Theory (0.805 fm) 
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FIGURE 14.6 

Comparison of theory and experiment for the Lamb shift of hydrogen. Open 
circles (o) denote direct measurements, and crosses (x) denote indirect mea- 
surements dependent upon the 2P; > lifetime of 1.5961887 (15) x10-? s. The 
measurement “This work" is by van Wijngarten et al. Lamb-shift measure- 
ment in hydrogen by the anisotropy method Can. J. Phys. 76, 95 (1998), 
fig. 3 pg. 102 [131]; other results appear in ref. [133]. Copyright 2008 NRC 
Canada or its licensors. Reproduced with permission. 
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Note that there is no explicit dependence on either the orbital angular 
momentum £ or on the electron spin s; instead the energy levels just depend 
on the total angular momentum j = + s. Hence states with differing £ but 
the same j will have the same energy — for example, the 25,/5 (ie. n = 2, 
£ = 0) and the 2Py/2 (ie. n = 2, £ = 1) are predicted to be degenerate. 


Hence 


Dirac eqn 
(Eos, /5 m Esp, 2) = 


which upon comparison to experiment [131] 


0 (14.36) 


(Bos, , — Esp, 2) P" = 1057.852 + .015 MHz ~ 4.374 x 10-9eV — (14.37) 


indicates a small but clear discrepancy. 

This discrepancy is explained by quantum-field-theoretic effects in QED: the 
self-energy of an electron bound to a nucleus depends on the orbital angular 
momentum £ and causes the above shift, first measured in 1947 by Willis 
Lamb and Robert Retherford [132]. The (rather long) calculation gives to 
leading order [130] 


QED mo? m (Esp) 91 
( 25175 2P1/2) 67 (1 ES (Eos) iP 120 ds : 
(14.38) 


an early triumph of QED! 

Other tests of QED involve investigation of spin-flip transitions in the 
ground states of positronium (e*e”) and muonium (u*e^). There is an 
ongoing effort in atomic physics to push for ever-greater precision in con- 
fronting QED with experiment to see just how accurate the Standard Model 
is. 


14.2.4 Running Coupling Constant 


A test charge inside a polarizable (dielectric) medium is shielded by the di- 
electric and so has a smaller Coulomb potential than in free space. This is 
a well-known effect in crystals. In a quantum field theory such as QED, the 
vacuum itself is a polarizable medium, as illustrated in figure 14.7. A charged 
particle can emit and reabsorb virtual photons which themselves emit and 
reabsorb ee” pairs which shield the particle: this is called vacuum polariza- 
tion. 

We observe a charged particle from large distances when it is “fully shielded.” 
As we probe closer and closer to the “core” of the particle (i.e. within a Comp- 
ton wavelength), shielding becomes small and the potential due to the *bare" 
charge is observed. 

How do we actually measure this effect? Recall that if we have 2 charged 
particles of charges Z¡e and Zoe that the lowest order scattering diagram is 
given by fig. 14.8 for which the matrix element is 
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FIGURE 14.7 
Schematic diagram of vacuum polarization. 


FIGURE 14.8 
Lowest order QED scattering. 


Z Zoe 
(pj — pı) 


M = — [u(pi)y" u(pi)] 5 [u(po)y,u(pz)] (14.39) 


However, higher-order corrections modify this. One of the most interesting is 
shown in figure 14.9, which contributes 


et 
Abe Ta OTP (14.40) 


6M = —i [u(p1)y u(p1)) ———3á 
(pi zm pi) 


to the matrix element, where the quantity Z,,, is, using the rules for QED, 


dtk Tr [y, (E - m) s, (d — E 4- m)] 
d res 
jo uen (=E? =m?) 


2 A 2 
Ta q A dx q i " 
T 009m 12,2 inf. T r( Z) F guay K (q?) (14.41) 


m 
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with f (-5) and K(q?) calculable functions, and m is the mass of the 
particle in the fermion loop in fig. 14.9, here taken to be an electron. The 


FIGURE 14.9 
A higher order correction to the process in fig. 14.8. 


K(q?) contributes nothing to M, since it is proportional to q,,q, and so is 
annihilated by the [u^] terms. For example 


gap) ups) = Tlp, )gu(oi) = ütpt) (p — p, ) wi) 
= Up) (m — m) u(pi) = 0 (14.42) 
(recall (y — m) u(p) = 0 for any u-spinor). So the full matrix element is 


Mot = M + ôM 


= CE [u(p1)y" u(pi)] [u(p5)y,u(p2)] 
(pi —p1) 


2 Ad 2 
x (5 jin [E 1(-5)|) (14.43) 


If we take the limit A — oo we obtain an infinite answer. However, if we 
define a renormalized coupling via 


= 14.44 
eR =e (1-3 1972 m fe ey ( ) 


then (since (p, — pi)" = q“) we obtain to this order in e? 


Miot = = iuo] R (1+ Es (E) ) tomates) 
Za Lach 


q? E = SBS (-£) 


= — [u(p)1“u(p)] ) [u(ps)yuu(po)) | (14.45) 
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where we can interpret the terms Zier [u(p )y"u(p1)) = J” and Zaen [u(pa)y"u(pa)) = 
TY as the currents associated with the 2 charged particles. 

Let's see how to interpret this. Suppose that the two charged particles are 
approximately static, and each is spin-up. Then setting € = €“) and E ~ m 
in eq. (11.33) gives [u(p1)o^u(pi)] zz /2m4 and [u(p5)o"u(p3)] ~ y2m2, and 
q! = (p, — m)" = (Ej — E1, py — pi) = (0, pi — p1) = (0, q). In this case the 
Fourier transform of eq. (14.45) is 


M tot = LiZav 2miv 2m2V (r) 


where r = |r] = |, — 72| is the separation between the two charged particles, 
and we can interpret the function 


d3q e2 e dT 
Vn J (27)? lg? (1 : “Rf (22) 


dq e dT e? 
~ 2 = 14.4 
ej Go e aA er 


as the potential between them, where the last expression is obtained by re- 
taining only the lowest order in e. Including the next-order loop corrections 
[134] and Fourier transforming gives 


9 D] 
CI E E h 200] (14.47) 
T T 37 
e2 
where a = zÈ- is the familar fine-structure constant and 
?  2u?41 
Q(r) =| du— T v u? + 1e™?™ + O(a) 
1 U 
ln (+)- yr -Žž + mr «& 1 
EX CE e 14.48 
| Re Í Li de : i 


with yg = .5772... which is Euler’s constant. 

So for large r > 1/m, the quantum corrections Q(r) to the Coulomb po- 
tential fall off exponentially, and V(r) ~ —%, which is the usual Coulomb 
potential. However, for r < 1/m, the quantum corrections Q(r) grow loga- 
rithmically — in other words, V(r) increases with decreasing distance! 

We can express this phenomenon by modifying the electromagnetic coupling 
a — a(r). We say that the coupling varies (or runs) with decreasing distance, 
or increasing energy q?. For QED 


(14.49) 
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and so the running coupling increases with decreasing distance. In fact, it 


becomes singular at Q(r) = 3%, or at 
h 
r~ exp EA = 3.86 x 10729! em (14.50) 
mc a 


In other words the quantum potential of the electron becomes infinite at a 
non-zero distance (albeit a very tiny one) from its center! 

This singularity is called a Landau singularity, after the Russian physi- 
cist Lev Landau who argued that its presence in QED indicates that strong 
vacuum polarization effects screen the electric charge completely at short dis- 
tances [135]. Others, including Shirkov [136], have called it the Landau ghost 
contending that it instead indicates the internal inconsistency of quantum 
electrodynamics, since this singularity cannot be removed by renormalization. 
Shirkov’s viewpoint has by and large been the more persuasive one, with the 
singularity regarded as the distance scale in QED at which perturbation the- 
ory definitely cannot be trusted, though whether or not it implies that QED 
is fundamentally inconsistent is unknown. In any event, effects from quantum 
gravity become relevant at 10-??cm, a much larger distance from the center 
of the electron than where the Landau singularity is. 


alr) A 


o=1/137 
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FIGURE 14.10 
Qualitative behavior of the running coupling constant in QED 


Although I have given the dependence of a as a function of distance, it is 
actually common practice in particle physics to express the running of a as a 
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function of the momentum transfer q?. The result is 


(14.51) 


where zf = 57, Q? is the sum of the squares of the charges (in units of the 
electron charge) of the fermions that contribute for all energies less than or 
equal to viel . For example, for energies less than 100 GeV, all leptons and 
quarks except for the top quark will contribute to the sum, which equals 2 
(since we must include each color of quark as a separate charge). The quantity 
p is an arbitrary energy scale that cannot be fixed in QED. 

At low energies |g?| — u? — 0, the logarithm in the denominator vanishes, 
and a(q?) > a (u?) — a (0) = 35. A high-precision determination of a at 
the Z-mass yielded [1] 


a 1(M2) = 128.936 + 0.046 (14.52) 


where I have expressed the result in terms of o! for convenience. 


14.3 Questions 


1. Show that ; , 
le: (5-eA)] = |z- e4] EN 


Hint: remember that p — —ihV and that the vector potential A depends 
on space and time. 


2. (a) Calculate to lowest order the spin-averaged spin-summed matrix 
element for the process e^ + y — e^ +7. 


(b) Your answer will depend upon the polarizations (or the 
respective initial and final photon states. Show that 


2 2 
5 (e . 220 = 1 + cos? 0 
A=1 


where 0 is the scattering angle. 


(c) Use this to obtain a differential cross-section that is averaged over 
initial photon polarizations and summer over final photon polarizations. 
Work in the lab-frame. 


270 


An Introduction to Particle Physics and the Standard Model 


Note: This is a tedious and somewhat lengthy calculation. I recom- 
mend that you make use of the simplifiying tricks given in table 13.1, 
as well as average over initial fermion spins and summing over final 
fermion spins. You will also find it helpful to choose the polarizations 
of the initial and final photons to be orthogonal to the initial electron 
momentum. 


(a) Calculate to lowest-order the spin-averaged spin-summed matrix el- 
ement for the process y + y — e7 + e*. 


(b) Find the differential cross-section that is averaged over initial photon 
polarizations and summed over the electron /positron spins. Work in the 
CM frame. 


Show that 


d3q e id emir] 
ES lg? --m? — 4x7] 
where m is a constant. What does this become in the limit m — 0? 
Given that a~'(M#) = 129 where Mz = 91 GeV, find the value of 


the fine structure constant at 4/|g?| = 1 TeV, assuming that no other 
particles exist beyond the known quarks and leptons. 


Suppose the fine structure constant is found to have a value 10 times its 
low-energy value at væl = 2 TeV. How many new quarks and leptons 
must there be at energies between 100 GeV and 2 TeV if the pattern of 
generations in the Standard Model is upheld? 


15 
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Strong interaction physics first began with Rutherford's discovery of the nu- 
cleus in 1910 [18]. The solar-system model of the atom that emerged from this 
discovery soon gave way to Bohr's theory of the atom, and then to quantum 
mechanics. Rapid progress was made in understanding atomic physics during 
this period. However, for 20 years after Rutherford's discovery the structure 
of the nucleus remained quite mysterious: it had to be positively charged, but 
why didn't electromagnetic forces cause it to explode apart? In 1932 James 
Chadwick discovered the neutron [137]: this led to the realization that the 
nucleus is a composite object, bound together by a nuclear or “strong” force, 
one much more powerful than the electromagnetic force. The strong force — 
by definition — is experienced only by particles referred to as hadrons*. 

Understanding the nature of the nucleus and its structure was a long and 
arduous process of calculation, experimentation, and intelligent guesswork. 
'The subject of nuclear physics today is indeed a complicated subject, since it 
is a many-body problem that is still not fully understood. Our interest here 
is in understanding the elementary constituents of nuclear matter, which are 
known as quarks. In order to appreciate the origin of quarks (both in their 
conception and in their discovery), we'll proceed historically, looking at the 
development of key concepts as they occurred. 


15.1 Range of the Nuclear Force 


Shortly after Chadwick's discovery, Wigner realized that nuclear forces must 
have a very short range (of only a few fm, where 1 fm—10- cm) and must 
be very strong [138]. The reason for this came from a consideration of the 
binding energies of the deuteron (hydrogen with one extra neutron), tritium 
(hydrogen with two extra neutrons, sometimes called triton), and the alpha 
particle (helium). As table 15.1 indicates the binding energies are millions of 
times larger than the electromagnetic energies that bind electrons in atoms. 


* Actually there are two nuclear forces — a strong force and a weak one — but the distinction 
between them did not become clear until the 1950s. 
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TABLE 15.1 
Binding Energies of Small Nuclei 


Binding Energy in MeV 
# of Bonds ‘Total per Particle per Bond 


2H 1 2.2 1.1 2.2 
3H 3 8.5 2.8 2.8 
“He 6 28 7 4.7 


Table 15.1 displays the binding energies of these three nuclides, as well as the 
energies per particle and per bond. The energy per bond is not roughly the 
same, and so something else is needed to explain the large increase in binding 
energy. Wigner reasoned that a short-range force can explain this, because as 
the number of bonds increases the nucleons pull closer together, and thereby 
experience a deeper potential well. This gives an additional increase to the 
binding energies per particle and per bond. 


The mass and binding energy for “He were observed to be almost the same 
as for ?H — this provided evidence that the forces between any two nucleons 
are the same, apart from small corrections due to electromagnetic forces, an 
empirical observation for which there is overwhelming evidence today. 


15.2 Isospin 


At about the same time, Heisenberg [139] suggested that the proton and neu- 
tron are really 2 different states of one particle called the nucleon, analogous 
to the way that the spin-f e^ and spin-| e” are 2 different states of the elec- 
tron. A comparison appears in table 15.2. Just as we can write an electron 


TABLE 15.2 
Comparison between Spin and Isospin 


Spin-1/2 particle Nucleon in 
in ordinary space in Isospin space 


up proton 


down neutron 
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wavefunction as a linear combination of spin-} and spin-|, this perspective 
implies that we can write the wavefunction of a nucleon |N) as 


IN) = slp) + $ |n) (15.1) 


i.e. as a complex linear combination of proton and neutron wavefunctions. 
This seems strange: the charges of the proton and neutron differ, and so we 
don’t know how to define the charge of the state |N}. However, Heisenberg’s 
hypothesis entails that the nuclear force is independent of the electric charge. 
We therefore can combine the two states, ignoring the effects of electromag- 
netism, which are much smaller. To every such state there will also be an 
orthogonal state |P) : 


|P) = a |p) + B |n) (15.2) 


(m) Glwa) e 


where the matrix U must be unitary, 2 x 2 and of determinant one (i.e. 
aÇ — cB = 1) to conserve probability. In other words, Heisenberg's idea is 
expressed by saying that all nuclear interactions should be invariant under 
SU(2)-transformations of the neutron/proton doublet! 

We call such transformations isospin transformations. Just as we did for 
rotations, we can write a general SU(2)-transformation matrix U as 


so that 


U= exp |-i - 3] E ee where aG—csB=1 


Es 
= {U (a, 8,5, C)] = SU(2) (15.4) 
where there are 22 — 1 = 3 generators J = (3,,32,33). As noted in our 


discussion in section 3.5 of Lie groups and Lie algebras, by expanding in 
powers of small B you can show that the generators obey 


(Ta, To] = ¡cave Te (15.5) 


which you might recognize from eq. (3.10) in Chapter 3 as the Lie algebra 
of SO(3). These two different groups - SU(2) and SO(3) - have the same 
algebra!, so we can use the same group-theory arithmetic to describe nucleon 
wavefunctions. 

Hence isospin is a quantum number analogous to angular momentum, ex- 
cept that it “rotates” protons into neutrons (and vice versa) instead of spin-T 
into spin-|. A proton is the positive projection of a nucleon state onto the 


tI emphasize that this is a very special property of these two particular Lie Groups — in 
general two different Lie groups have distinct Lie Algebras. 
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z-axis in isospin-space, whereas a neutron is the negative projection of a nu- 
cleon state onto this axis. A projection onto any other axis will be some linear 
combination of the proton and neutron states. 


The isospin operator is denoted by 3. A particle undergoing nuclear inter- 
actions will thus be in a state V = |i, is), where 


yy = (3: 3) V =i(i+ 1)V (15.6) 
34V = iz (15.7) 


So the proton and neutron are the isospin states 


»- (5) -|s.3) (15.8) 
(es 


ie. the nucleon has total isospin i = 1/2. The 3? operator identifies the 
total amount of isospin, and the 33 operator identifies which component total 
isospin the state has — in other words, whether it is a proton or a neutron. 

Of course we need to eventually take electric charge into account. This is 
done by via the formula 


(15.10) 


(duse 0 for the neutron 


B { +1 for the proton 
2 


where the electric charge Q of a nucleon state in units of e and B is a quantum 
number called baryon number. We set B — 1 for the nucleon in order for this 
formula to work. Antinucleons (antiprotons and antineutrons) have B = — 1. 
Note that this formula defines baryon number — at this stage it does not lead 
to any new predictions. 

In strong interactions, isospin is a conserved quantum number. The earliest 
evidence for this came from nuclear physics experiments that indicated that 
np, pp, and pn forces were all identical (except for much feebler electromag- 
netic effects). For example, energy level spectra in mirror nuclei nuclei which 
change into each other under pn interchange are almost identical: e.g. 13C 
and I??N or "Li and " Be or YB and C. 

Isospin invariance therefore means that the isospin operator 3 commutes 
with the Hamiltonian Hy governing the nuclear forces 


[Hw 3] =0 (15.11) 
The presence of electromagnetism destroys this invariance, which we express 


as 
ES g 35543] £0 (15.12) 
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However, since electric charge is conserved, then we must have the charge 
operator commuting with the total Hamiltonian, which gives 


[Hy + Hem, Q] = 0 => [Hn + Hem, 33] = 0 (15.13) 


where the last equality holds provided the baryon number B is a conserved 
quantity. This means that the 3rd component of isospin is conserved even in 
the presence of an electromagnetic interaction. This situation is analogous 
to having the z-component of spin preserved in the interactions between a 
spinning particle and a magnetic field. 

As with spin, we can combine i = 1/2 states to form states of higher isospin. 
For example, two i = 1/2 states can form either an i = 1 triplet or an i = 0 
singlet: 


triplet (15.14) 


ji = 1,is = —1) = |1, 22) |1, 23) = |n) |n) 


li = 0,88 = 0) = 25 (3,2) 13-3) 
singlet (15.15) 
= Js (lp) In) — In) [p)) 


The singlet state is the deuteron, D: a stable bound state of a proton and 
neutron. It is the nuclear analog of para-positronium. We could call it para- 
deuterium, and the triplet state would then be called ortho-deuterium. How- 
ever, since the triplet state is unstable we drop the “para” and “ortho” labels 
and call the singlet state the deuteron. 

In 1947 another strongly interacting particle called the pion was observed 
[140]. In fact there were 3 pion states: m*,T” and 7°. The first two had 
identical mass, and the last of these had almost the same mass as the first 
two. This suggested that these 3 particles formed an isospin triplet. The 
charge formula 


B +1 for at 
Qua = Ist — O for T? (15.16) 
—1 for m7 


= As far as we know, baryon number is conserved — there is no experimental evidence sug- 
gesting otherwise. Yet many physicists think that perhaps baryon number is only approxi- 
mately conserved, and that the proton (being the lightest baryon) can actually slowly decay 
into other particles. Such processes are permitted in grand unified theories, and may be 
important in the early universe [8]. 
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implied that B = 0 for the pion — indeed, it was the discovery of the pion 
that led to the introduction of baryon number B so that Heisenberg’s formula 
could be satisfied in a more general context. 

These assignments contain enough information to make predictions in scat- 
tering experiments using isospin conservation. For example, consider the two 
nuclear reactions 


(A) p+p—d+mr* and (B) ptn—d+n° (15.17) 
Reaction (A) has i = 1 in both the initial and final states, whereas reaction 


(B) has i = 1 in the final state, but is a 50/50 mixture of i= 1 and i= 0 in 
the initial state. So we expect the cross-section for (B) to be half that of (A): 


o(p+n—d+n)=Z0(p+p d+r*) (15.18) 


which is indeed observed [141]! 
As another example, consider pion-nucleon scattering, t + N — s 4- N. 


There are six possible elastic scattering processes: 
+ +4 3 
c ao a both have i = 2 (15.19) 
T n T n 2 
T +Fp_>T +p 
B 0 
T p T +n 2001 
MSAN UA all have i = (15.20) 
nt+n n? +p 


along with their time-reversed counterparts. To analyze these, we need to 
write the pion-nucleon states into isospin irreps, a job easily done using the 
18 i Clebsch-Gordon table from figure ??: 


rin = [5.) 

ene iles) 13/23) 

ew Kee) e 
km = S73) - Vs 72) 

ew dl oF) 

om =|3,-3) 


SYou can easily check these assignment by noting that for the top two reactions lis] = 3 
on each side, whereas ig] = i on each side for the rest of the reactions. 
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The cross-section is proportional to the square of the matrix element for this 
process, Mal = |(f |H;n| l^ where Hyn is the relevant part of the Hamil- 
tonian describing the nuclear force (neglecting electromagnetic interactions). 


If we impose isospin conservation, then | Hy, 3 — 0, which implies that 


3 
3 
ape 


meaning that the total isospin cannot change in any scattering process. Defin- 
ing (i = 3| HN p= 3) = M3 and = i| HN li = 3) = Mı, we see that 


i- 3) =0 (15.22) 


3 3 
(xt p| AN |x*p) = (n7n| AN |^ n) = el HN B = M3 (15.23) 


whereas 


(n7 p| Han |n p) 


CHOEEPHOREIILSTCH SS EPHHED) 


1/3 3X. B74 1 
5 |) +s en f) 


(pl AN Inn) 


CA) 


S te) 5 Gar a) 


3 \2 2/ 3 M2 2 
2 2 
=m- EM (15.25) 


Summarizing, we have 


(xt p| Hn |n p) = (n| Hav |n) = Ma (15.26) 
1 2 

(17 p| Han [17 p) = 5 Ms + Ma (15.27) 

(n7 p| Han |n?n) = eun — 2 (15.28) 


and so 


o(nt+p—nttp)=o(n +n — 0 +n)=A|Ms|” (15.29) 


278 An Introduction to Particle Physics and the Standard Model 


o (17 +p— +p) = A| EMs Ma 
1 
= zf |M; + 2M1|* (15.30) 
2 2 
o (n +p—TD+n)=A va M Y ut 
2 2 
E gh Ms - Ma] (15.31) 


where & is a phase-space factor common to all processes. Hence we obtain 
the following predictions for the scattering cross-section ratios: 


a(nt+p—at+p) : c(n +p— mc +p) o (nT c p— mn? tn) 
2 


Mı}? Mi 
=9: (142 | :2|1- — 
| M3 | M3 
"EF 
c 9:1:2 f |—|«1 
i pee 
M3 
c 0:2:1 if | =—— 1 15.32 
ME (15.32) 


These ratios do agree with nuclear scattering data [142]. 


Many experimental measurements have been made of both the total and 
the differential cross-sections for pion-nucleon scattering. The earliest exper- 
iments measured a collimated beam of charged pions traversing a target of 
liquid hydrogen. For both positively and negatively charged pions there is a 
strong peak in the total cross-section at a pion kinetic energy of 200 MeV at 
an invariant mass of 1236 MeV (see fig. 15.1), discovered by Anderson, Fermi, 
Long and Nagle in 1952 [143]. This is an example of a resonance as I dis- 
cussed in Chapter 9. The different resonances are given different names. The 
one at 1236 MeV is called the A(1236). The total cross-sections have a ratio 
of o(m+ + p)/a(x- + p) = 3, indicating that the J = 2 amplitude dominates 
at these energies, and in good agreement with the data once we realize that 
we must add the cross-sections for c^ + p to go to either of its possible final 
states. 


Note that in general there will be several different amplitudes contributing 
to the total cross-section in any given energy range, which means that we 
can't always interpret a peak as specifying a unique resonant state. 
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FIGURE 15.1 

Variation of the total cross-section for 7* and 7~ mesons on protons as a func- 
tion of the invariant mass. The positions of only a few of the known states 
are given. Image from Figure 4.4 of Introduction to High Energy Physics, Ath 
edition, D.H. Perkins, Cambridge University Press (2000); used with permis- 
sion. 
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15.3 Strangeness 


As nuclear physics experiments achieved higher and higher energies, new res- 
onances called V-particles were observed. First observed by Rochester and 
Butler in 1947 [144], by 1952 many of them had been seen. They were pro- 
duced in large numbers (they had large cross-sections of mb) and had long 
lifetimes (about 10 !^s), and decayed into products in a manner that did not 
conserve isospin. What were these strange things? 

As an example, one V-particle was called the A-hyperon. It had no charged 
counterpart, suggesting i, = 0. However, it decayed via 


A—m +p (15.33) 


which has i3 = -i in the final state, meaning that the total final-state isospin 
cannot be zero. Furthermore, the lifetime for this process was Ta œ 10710 
sec, far longer than typical strong-interaction lifetimes of 1072 sec! 

This suggested that these V-particles — such as A's, 3s, K's — were pro- 
duced by some strong nuclear interactions but decayed via a different weak 
nuclear interaction (as Pais first proposed [145]). If this is so, such particles 


The same could have been said for the charged pions, except that neutrinos were associated 
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must be produced in pairs (to conserve isospin), but can decay as singletons. 
This is in fact what is observed — for example we see 


D + Kt YN —nNn+T 
T +p u°+K° and 39. na 70 (15.34) 
A+ K? 
but we never see 
o> + at 
T +p —X n+k0 (15.35) 
LET 


Gell-Mann and Nishijima suggested that these strange phenomena be de- 
scribed using the formula [146] 


Q =J; + E + 2 (15.36) 
2072 
where S is a new quantum number called strangeness. 

'The assignment of strangeness to a given particle follows from isospin as- 
signments plus some conventions. By definition the strangeness of the KT is 
set equal to +1. All nucleons and pions have zero strangeness. As new parti- 
cles continued to be discovered, isospin assignments were made on the basis 
of simplicity and near-degeneracy of masses of different particles when/if they 
occurred. Other assignments were made based on reactions that are assumed 
to conserve strangeness in production (but not necessarily in decay)!. From 
the reactions 


X--K* 
T +p— 4 094 Kk? (15.37) 
A4 K? 


we see that the X~ particle has S = —1. Clearly the antiparticles K^ and 
X* have S = +1. A strangeness-conserving reaction is 


p A 4 29 
S -1 0 -1 0 
J + 0 0 


implying from the above that K? has S — --1. This raises a bit of a puzzle — 
there are two Kaons with strangeness +1, but only one with S = —1. Gell- 
Mann proposed that there should be an antiparticle K? with S — —1. This 
particle was soon seen in the reaction m+ + p — p+ K? + Kt. 


with their decays, and it was then thought that weak interactions were associated only with 
neutrinos. For the A and its compatriots, the decay is purely hadronic. 

We now understand (and will see later) that strangeness is conserved in all strong inter- 
actions, but violated in weak ones. 
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15.4 Flavor 


'The assignment of strangeness and isospin quantum numbers soon became 
haphazard and chaotic. It would be much nicer to use some kind of group 
structure to organize things, the way that isospin did for the neutron, proton 
and pion. In 1963 Murray Gell-Mann [147] and (independently) George Zweig 
[148] did just that! They enlarged the SU(2) isospin symmetry to include 
strangeness — it became an SU(3) symmetry. Let's look at the basic outline 
of how they did it. 

Gell-Mann and Zweig noticed two things about the 10 lowest-mass spin-2 
baryon states: 


1. Allstates within the same isospin multiplet had approximately the same 
mass (to within a few percent) 


2. States of different strangeness differed in constant-mass increments 


'These features are easy to see if you plot the different spin-3 baryon states 
on a graph with strangeness on the y-axis and isospin-3 component on the 
a-axis as in fig. 15.2. For example all four of the A(1232) states had i = 3: 
each of them has strangeness S — 0, and about the same mass (namely, 1232 
MeV). The lowest-mass spin-3 baryon states with S — —1 were all in an 
isospin triplet (i.e., i = 1) and each had a mass of about 1384 MeV (these 
are called the X-states). The S = —2 states (the =*’s) each had mass ~ 1533 
MeV and were in an isospin doublet (ie. i= 3). 

One particle was missing in the pattern: an S = —3 state of spin-3. Since 
the mass increments are ~ 150 MeV, as isospin decreases, this state should 
have a mass ~ 1680 MeV and have i = 0. Gell-Mann called this particle the 
QT (since it should be the last particle to be discovered), and predicted its 
existence in 1961 — three years before its discovery [149]! 

Similarly, the eight lowest-mass baryons with of spin-4 had an eight-fold 
pattern. Here the mass increments were ~ (190 + 15) MeV: For example, 
My — My ~ 177 MeV, and Mz — Ma ~ 203 MeV - the increments are not 
quite as good as the spin-3 collection, but are still not too bad! (Note that 
there are two kinds of X-states: a spin-3 isospin triplet of mass 1384 MeV 
and a spin-3 isospin triplet of mass 1193 MeV). 


These regularities in strangeness and isospin were accounted for by Gell- 
Mann and Zweig by hypothesizing that there were three types of fermion 
constituents to baryons which Gell-Mann called QUARKS and Zweig called 
ACES. Gell-Mann's nomenclature won out over Zweig’s, and persists today. 
Each quark had a name: 
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FIGURE 15.2 
Isospin/strangeness relationships amongst the spin-3/2 baryon states. 


“up” — for is = +4 (and S = 0) 
“down” —foris = —i (and S = 0) 
s “strange” -fori—0 (and S = —1) 


a e 


Baryons were assumed to each be composed of three quarks. Each quark 
was presumed to be “one-third” of a baryon and so were assigned B — i. 
Since 


1 Y 
Q=3+5(B+S)=h+55Q= E ford (15.38) 


the quarks had fractional electric charge! This rather odd property of quarks 
prevented most physicists from accepting them as actual particles until about 
1972-74. The quantity Y = B + is called hypercharge, and its usage persists 
today. 

'The quantum numbers isospin and strangeness were redefined into what 
we now call flavor, and the group SU(2) (which transforms the definitions 
of up and down) was enlarged to become SU(3) (which now transforms the 
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FIGURE 15.3 
Isospin/strangeness relationships amongst the spin-1/2 baryon states. 


definitions of up, down and strange). This simple idea explained both the 
patterns and the mass-splittings of the eightfold collection (the OCTET) and 
the tenfold collection (the DECUPLET) of baryons. As S decreases (becomes 
more negative), more strange quarks are present and the mass increments 
arise because m, ~ 150 MeV. m,, and mg are presumed to be almost equal, 
explaining why proton and neutron masses are roughly equal. The isospin 
symmetry is an SU(2) subgroup of the SU(3) flavor symmetry, and it works 
so well because of the near-equality of up and down masses. 

We can now understand the octet and decuplet in terms of different col- 
lections of quark flavors (i.e. u, d, and s). Working non-relativistically, we 
expect that a baryon wavefunction is a bound-state wavefunction of three 
quarks and has the form 


Vota] = 9 (space) v (spin) x (flavor) (15.39) 


and that the SU(3) symmetry allows us to interchange any two quarks**. 
Since we regard quarks of different flavors as being different states of the same 
particle, the Pauli principle demands that V¿¿;.] be antisymmetric under the 
interchange of any two quarks. 


**Note that this SU(3) symmetry is an approximate symmetry: it would only be exact if 
the mass of the strange quark were equal to that of the up and down quarks. 
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Now a bound state of 3 quarks can be made from any 3 flavors, leaving 
3? — 27 distinct possibilities for x (flavor). However, we saw earlier that the 
8-fold and 10-fold collections (i.e., the octet and decuplet) appear to be rather 
special. The reason for this is that these collections are irreducible multiplets 
of the (approximate) SU(3) flavor symmetry. Let’s see how this works. 


One possible multiplet is completely symmetric under flavor interchange. 
Beginning with, say |uuu), we write down all possible states that we can reach 
from it by replacing one of the u’s with either an s or a d in all possible ways 
that respect the full symmetry (so that the state doesn't change when we 
switch the positions (not the flavors) of the quarks). For example we can 
get |ddd) by replacing each u with a d. There are 10 possible states we get 
from this procedure, and so we get a decuplet, whose wavefunction we call 
Xs, shown in fig. 15.4. 


(dud+ddu+udd)/ V3 (uud+udu+duu)/V3 


ddd 4-----= — $ uuu 
(ydstusd+dsutdustsdu+sud)/V6 
(dsd+dds+sdd)/V3 a » S quustusrtsa 


/ 
/ 
$ 


& 


(dss+dss+ssd)/V3 Q 9 (uss+uss+ssu)/V3 


SSS 


FIGURE 15.4 
The decuplet xs. 


Another multiplet must be one that is completely antisymmetric, in which 
the interchange of any two flavors yields a minus sign. There is only one such 
possibility, called xa, given in figure 15.5. 


Finally, there are also multiplets that are combinations of mixed symmetry 
which are antisymmetric under the interchange of either the first two flavors 
(x12 ), the last two flavors (x23), or the first and last flavors (x13 ). Each 
multiplet has 8 states, as illustrated in fig. 15.6. The mixed-symmetry multi- 
plet x13 does not appear in fig. 15.6 because it is not independent of the other 
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(uds-usd+dsu-dus-sdu+sud )/ V6 


FIGURE 15.5 
The completely antisymmetric flavor state x 4. 


multiplets 
X13 = X12 + X23 (since (X32 + X21) = — (x12 + x23) ) (15.40) 


So these are all the possibilities there are: 10 + 1 + 8 +8 = 27 states in 
total. We write this as 


38®3@3=10616868 
all possible products of _ 4 distinct multiplets that 
3 flavors 3 times ^. transform independently of each other 


'To get the spin part of the wavefunction, we just use the Clebsch-Gordon 
tables ?? twice to get 8494 = (160) @4 = (103) 6(0@4) = 26563: 


m Em b T 
TN (Pa (9 — 0540 
3,3) = IL 
1,1, = BUN) HD) IM 1 
22/12 V2 y i 
I — 3 = vg (1 = M1) n) E vl ed 
LD. = Slt) (11) -1) N 
2° 2/23 V2 i 
$2) = ya D (IN) = 1 »| NN eee 
where the 1 and | symbols denote spins with z-component +4 and -i re- 


spectively. Note that we have a fully symmetric spin-3 state and two mixed- 
symmetry spin-z states (as with flavor we have 13 = V1» + V», so the third 


mixed-symmetry state is not independent). 
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FIGURE 15.6 
The mixed-symmetry states x12 (top) and xoa (bottom). 


15.5 Color 


Recall that we need V,,,,, = ® (space) v (spin) x (flavor) to be fully antisym- 
metric because of the Pauli principle. Since we regard Yota] as the wave- 
function for a bound state of 3 quarks, the lowest-energy bound states will 
be the lowest-mass baryons. Such lowest-energy states will have no orbital 
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angular momentum amongst the quarks (since angular momentum increases 
the energy). This forces ® (space) to be symmetric under quark interchange 
(just as the helium atom in its ground state is symmetric under interchange 
of the electrons). 

So it seems we need the product wavefunction w (spin) x (flavor) to be fully 
antisymmetric so that Yota] will also be. Unfortunately this is impossible! 
For the decuplet, we must have w(spin) = Y, and x (flavor) = x, since 
experiment tells us there are 10 states with spin-3. However, these are both 
symmetric wavefunctions and hence their product is also symmetric. For 
example: 

JAt+) = |utut wut) (15.44) 


and we have 3 identical particles in the same state, violating the Pauli prin- 
ciple. 

This problem was resolved by Greenberg [150], who suggested that there 
was a new quantum number called color! He postulated that each flavor of 
quark (u, d, s) came in three distinct types, or colors — say red, green and 
blue. This hypothesis implies that 


Vio. = 9 (space) v (spin) x (flavor) v (color) (15.45) 


and so now we have the possibility of making Viota] antisymmetric by making 
y (color) antisymmetric under the interchange of flavors. In fact, we know 
how to do this: just take y (color) = pa, where y4 is given by x4 with the 
replacements u — r, d —^ b, s > g: 


1 
ETT 


'This proved to be an enormously fruitful idea, and today we understand 
color symmetry — which also happens to be SU(3) - to be the foundational 
symmetry of the strong interactions. Color is to strong interactions what 
electric charge is to EM interactions, except that there are three types of 
"strong charge": r, b, and g. There are also three types of anti-strong- 
charge: 7, b, and g. The state pa is a color singlet: no matter how we 
switch around the definition of color amongst the quarks, this state remains 
the same (up to a minus sign). The physical interpretation is that pa is the 
wavefunction for a color-neutral bound state of three quarks. 

The SU(3) color symmetry is postulated to be an exact symmetry because 
two given quarks of the same flavor but of different color have the same mass, 
charge, and all other quantum numbers. This is quite unlike the SU(3) flavor 
symmetry, in which quarks of different flavor do not have the same mass. This 
is perhaps the most confusing thing to deal with when one first encounters 
quarks: there are two mathematically identical symmetry groups operating 
in very different physical ways. To avoid confusion the color symmetry is 
often denoted as SU¢(3) and the flavor symmetry SU p (3). The role played 


(Irbg) — |brg) + |bgr) — |gbr) + |grb) — |rgb)) (15.46) 
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by SUc(3) color symmetry is completely different than that played by the 
SU (3) symmetry. To say that there is an SU¿(3) color symmetry is to assert 
that the strong interaction force is the same under any unitary transformation 
of color charge. The SU p (3) symmetry of the quark model, on the other hand, 
is an approximate symmetry that is used to classify the different hadrons that 
are made of up, down, and strange quarks. If this symmetry were exact it 
would mean that all hadrons of a given spin and baryon number would have 
the same mass. 

Because color respects an SU(3) symmetry, we can also generate color 
decuplets, octets, etc. Why don't we see these? Without any compelling 
experimental data, we append another postulate to our list, which states: 


[Every naturally occuring bound state of quarks is a color singlet 


which implies that we always make use of y4 in constructing hadron wave- 
functions. 


From these foundations we can build what is called the quark model of 
hadrons. 


15.6 Questions 


1. Do the following exist? Why or why not? 


(a) an antibaryon with charge 4-2 
(b) a positive meson with strangeness 4-1 
(c) a spin-0 baryon 


2. An alpha particle has zero isospin. Can the reaction D + D — a 4- s? 
take place? Why or why not? 


3. Consider the operator 
G = C exp (i232) 


an operator known as the G-parity operator, where C is the charge- 
conjugation operator and 32 is the 2nd component of the isospin oper- 
ator J. 


(a) Suppose we define for the pion 
cht) = - I) 


Show that under G-parity 
Glz) = —|n) 
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for the pion triplet. 
(b) What is the action of G on a state |W (nr)) of n pions? 


4. Show that all non-strange non-baryonic nucleon states are eigenstates 
of G. 


5. A glueball is a color-singlet bound state of two gluons, something ex- 
pected from QCD. The simplest glueball state |6) has positive parity, 
zero angular momentum, and is even under charge conjugation. 


(a) What is the G-parity of this glueball? 
(b) What is its strangeness? 


(c) Which of the following reactions are allowed 
(a) 6 Sat m7 
(b) 6 2 «9 4- 7° 


(c) 6 — T? 4-2? +r? 


assuming that the glueball is sufficiently massive so that energy conser- 
vation is satisfied for each? 


6. The strong interactions are invariant under charge-conjugation. For the 
following decays 


(a) P — nt n7 


(b) w => nrt +r +r’ 


how does this constrain the matrix elements (p1...Pm |M| p) for each case? 


16 
The Quark Model 


DOI: 10.1201/9781420083002-16 


The basic postulate of the quark model is that all hadrons are bound states 
of particles called quarks. Quarks must be fermions, since only fermions can 
give bound states that are both bosons (if an even number of quarks bind 
together) and fermions (if odd number of quarks bind together). 

The quark model was originally constructed using three kinds (or flavors) 
of quarks: up, down, and strange. Today we know that there are three more 
flavors of quark — charm, bottom, and top — making for six flavors in all. 
However, these last three flavors are much heavier than the first three, and do 
not play a role in our understanding of the lowest-mass baryons and mesons. 
So I will defer discussion of the heavier quarks to Chapter 18. 


TABLE 16.1 
Quark Quantum Numbers and Masses 


Q 3 3. S C B T B Y Mass (MeV) 


+0.96 
d -4 4 -i 0 0 0 0 4 i br 
u +2 4 1 00001 1 BT 
s -4 0 0 -1 0 0 0 4 -2 104 +38 
c +2 0 0 01001 I LUN 
b -4 0 0 0 0 -1 0 4 i 4, 200 * 5. 
t +2 0 0 0 0 0 1 ¿ i  171,200+2,100 


Table 16.1 lists the rather surprising properties of quarks: their electric 
charge is fractional, their baryon number is 1/3, and their masses* have no 
discernable pattern. By convention, the sign of the quark flavor is chosen to 


*The definition of quark mass is rather subtle, as we shall see in this and subseqent chapters. 
The list of masses in table 16.1 are current quark masses obtained from a broad range of 
experiments as discussed in the Particle Data Book [1]. 
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be the same as that of its electric charge. Direct searches for free quarks — 
in cosmic rays, in Millikan-type experiments, and in rocks from the Moon — 
have been fruitless. No free quarks have ever been observed [151]. This is now 
believed to be a consequence of the color force that binds them together. 

'The quark model reached perhaps its most sophisticated form due to the 
efforts of Nathan Isgur and Gabriel Karl, who showed that all of the low energy 
baryon and meson states could be understood as bound states of quarks using 
the principles of atomic physics [152]. While I can't present the full details of 
this model, I can give you its foundations and basic ideas [153]. Let's begin 
our study of the quark model by looking at the two main kinds of hadrons: 
baryons and mesons. 


16.1 Baryons 


A baryon is a hadron that is postulated to be a bound state of three different 
quarks. The bound-state baryon wavefunction is 


Vbaryon = 9 (space) v (spin) x (flavor) y (color) (16.1) 


where 


(color) = pa = Fa rto lbrg) + |bgr) — |gbr) + |grb) — |rgb)) (16.2) 


is the fully antisymmetric color wavefunction, ensuring that all baryons are 
color-neutral, since the SUc(3) symmetry leaves the pa state invariant. The 
spatial wavefunction ® (space) is a function of the positions of the quarks. It 
must be fully symmetric under interchange of quark position because it cor- 
responds to a state of lowest energy (i.e. we wish to describe the lowest-mass 
baryons), and from atomic physics we know that such a state has principal 
quantum number n = 0 — this is an S-wave state, which is symmetric. 

The Pauli principle requires Vparyon to be fully antisymmetric under the 
interchange of any pair of quarks. Since © (space) v (color) must be fully an- 
tisymmetric, this means that v (spin) x (flavor) must be fully symmetic under 
quark-pair interchange. 

So if all baryons are indeed bound states of three quarks, what we must 
do is combine the irreps we found in the previous chapter for y (spin) and 
X (flavor) in all possible symmetric combinations. Recall that we found 


SPIN: { dai. a FLAVOR: UO Xia x33, XA} 


where the superscript is a reminder as to which multiplet we are considering 
and the subscript denotes the symmetry/antisymmetry of the wavefunction. 
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All possible combinations of these wavefunction irreps that are symmetric 
under particle interchange will give us the lowest mass baryons. 

One obvious symmetric combination is vi ^x19: we must have a decuplet 
of spin-3. We get all of the explicit states by taking a direct product of the 


two wave functions. For example: 


ag) = Ime = lat wt wt) (16.3) 

Jars 5.5) = n e (Ae rro «i enm) 
=Fellututultlutuluty+lulutut) (64) 

Eu 2n = Sa us) + lusu) + ound) @ (7 (tt) + IHL) 0) ) 


(ututs|)+lutulst)+lulutst) 
tlutstul)+lutslut)+lulstut) (16.5) 
t|[stutul)+|stulut)+[slutut)) 


and you can construct all of the others by a similar procedure of taking direct 
products. 

The numerical coefficients allow us to compute probabilities of quark dis- 
tributions inside a baryon. For example, suppose we had a X** in a j; = > 
state. We see from eq. (16.5) that there are three torme in its wavefunction 
that have a spin-| s-quark. Each has a coefficient of i 3, and so we would 


have a G)? + (2) + Gy = i chance of finding a mon s-quark. Similarly 


there are six terms containing a spin-| u-quark and so there is a 6x G)? =% 
probability of finding a spin-| u-quark in the X**— if we were able to isolate 
the quarks! 

Another symmetric combination follows using the x1; and x33 octet wave- 
functions. Since vix, is symmetric on (1 — 2), we can form a fully sym- 
metric wavefunction by taking 


1/2 1/2 1/2 
vx - (vid Xia + bas X53 + wy} xis) 


=% (204 + Quah xs + bos da + ty E (16.6) 
where we recall that yi? = va + yi? and x54 = x3, x34, and N is 


some normalization factor to be determined for each state (since there will 
be cancellations between some terms when we look at specific spin/flavor 
wavefunctions). So we expect a spin-1/2 baryon octet! 

Note that the observed octet is a mixture of the two flavor octets — it must 
be to ensure that ~ (spin) x (flavor) is fully symmetic. Again, we find explicit 
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states by direct multiplication. For example, a spin-up proton is 


bs) - n [2 [o 00 umm] [o ua) — ie t| 


a imam - i| (lo (ua) - lau) 
«ime = um] | d - fe) [a] 
«dr - umi] [7510 d - 16) | baen 


which upon simplification becomes 


urs 
ig 


5) - a (2h dut) 2lul at ut) 24010100) 
+2ld|ufut+2lufufd]) 
—2luftuldf)—2\lutdftul)+2|ut dl u1) 
+lutdtul)—lutdluf) 
—|dtufulj)+|[dtulut)t+|utul dt) 


dale diu +luldtut)) 


-(aietatun-teietun-Mtulun 
42d utu 42lututd D -lutuldf) 
-lutdtul)-latutul)—lwlutay | (16.8) 


and the normalization coefficient can be obtained by recognizing that each 
state in the preceding expression is orthogonal to all the others. Hence the 
requirement that (p; i, ilp; Z, 5) = 1 gives 


bs) = (tatu eaten lett 


21d utut) e2luTutd[ -lutuldT) 
-htdtul)-MtutuD-leiutat)) (16.9) 


and so the probability of finding, say, a spin-down d-quark in a proton (where 
jz = +4 for the proton spin) is Y x (22 + 2? +2?) = 2 

Are there any other symmetric mixtures of y (spin) and x (flavor)? The 
answer is no: no other combination of mixed-symmetry wavefunctions can 
be made fully symmetric, and there is no fully antisymmetric state that can 
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be combined with xa. This agrees with observation: all lowest-mass baryons 
are either spin-3 and fit into a decuplet, or are spin-4 and fit into an octet". 

Constructing baryon wavefunctions is a straightforward, though somewhat 
tedious, job. There are three spins, three flavors, and three colors, and 
they all must be put together in a manner that satisfies the Pauli principle. 
The SU (3) flavor symmetry is approximate since the up, down and strange 
quarks have different masses (though the up and down masses are nearly 
identical). The SUc¢(3) color symmetry is believed to be exact: a red up- 
quark has all the properties of a blue up-quark or a green up-quark, apart 
from color. Color plays an essential (but hidden) role in all of this — without 
it we would also be searching for antisymmetric spin-flavor wavefunctions. 
The only way to do this for spin-2 is to combine it with the antisymmetric 
SU;(3) singlet x1, — this would mean there would be only one spin-3 baryon, 
in strong contradiction with observation. 


16.2 Mesons 


Another way to get color-neutral states is to combine a quark-antiquark pair 
into a bound state. Since there are 3 flavors, 2 particles and 2 possible spins, 
there are a total of 3 x 3 x 2 = 18 distinct such states. These states are called 
mesons. 

A meson wavefunction is of the form of eq. (16.1) 


V meson = 9 (space) v (spin) x (flavor) v (color) 


where the Pauli principle is no longer an issue, since the particles are not 
identical. The spatial wavefunction will still be symmetric (since we want 
the lowest-energy states, which means $ (space) is an S-wave) and the color 
wavefunction is guaranteed to be neutral (i.e. a singlet) since it is made from 
color-anticolor pairs. This color wavefunction is 


1 
73 = 5 


where each color is paired with its anticolor. 


(Ir) + |bb) + |99)) (16.10) 


TOf course excited states would be different because in that case y(space) is not necessarily 
symmetric since the quarks would now have some orbital angular momentum. In this case 
we can construct other combintations of (spin) and x(flavor). The only restriction is that 
the bound state wavefunction be antisymmetric under interchange of any two quarks (and 
that it is a color singlet). 
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'The spin wavefunctions are 


11,1) =111) 

11,0) = 5 (INL) +ID) > triplet Yr (16.11) 
|1, 1) = [1 

0,0) = 25 (IL) = L0) } singlet v8 (16.12) 


as we have seen before in eqs.(6.23, 6.24) for positronium. The spin-1 triplet 
state is the “ortho” state and the spin-0 singlet is the “para” state. These 
are the only ways of combining two spin-4 particles. Hence the quark model 
predicts that all (low-energy) mesons are either spin-0 (the scalar mesons) or 
spin-1 (the vector mesons). 

The flavor wavefunction must now be made from a quark and an antiquark. 
The 9 possible flavor states decompose into an octet and a singlet as in fig. 
16.1 where the minus sign is a (rather irritating) convention for expressing an 


oe” 
A (uz + : -255 IN | T (uir +dd +55 ) 


-ud 
x Q 9-u ¡ 9 


NE ü-di) / Y X 


FIGURE 16.1 
The octet and singlet meson flavor wavefunctions. 


isospin doublet of antiquarks. We have 


ja) = - 


i= =>) = 
t= p853 U) = 


and note that d is isospin-up (it’s the antiparticle of isospin-down, after all), 
whereas Ñ is isospin-down. In group-theoretic language we say that the three 


. 1. 1 
t= sh =-5) (16.14) 
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quarks u, d, s belong to the fundamental 3 representation of SU (3) and that 
the three antiquarks belong to the conjugate representation 3. The combina- 
tion of both we write as 3@3 = 8@1, meaning that we have a collection 
of 8 objects (the octet) that transform into one another under SU (3), and 
one object (the singlet) that just transforms into itself under SUp(3). You 
can easily check both statements by seeing what happens under interchange 
of any pair of flavors. 

For the spin-0 mesons we identify these flavor wavefunctions as in fig. 16.2, 
or in other words 


x é © @ r Th 


FIGURE 16.2 
'The spin-0 mesons. 


[r*) = - lud) ys 

le?) = — (um) - |d2)) v9 

|n) = |du) we 

|A+) = Jus) pe (16.15) 
|K°) = |ds) ws 

[&^) = - sd) 42 

K7) = |su) vf 
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for the pion and Kaon states?. 
The 7 states are a little more subtle: the flavor-spin states are 


luz) + |dd} + |ss)) Ys (16.16) 


Ins) = ~= (lua) + |dd} — 2|ss)) pe (16.17) 


Note that the no is a para-spin state and a para-isospin state (as is the ng). 
This is in contrast to the pion-triplet, which is a para-spin state but an ortho- 
isopin state. 

'The experimental situation is a bit more complicated since the actual states 
observed in nature are orthogonal linear combinations of the no and ns: 


Im) = cos 8, |no) + sin 6, |n8) (16.18) 
In) = — sin 6, [no) + cos 6, (ns) (16.19) 


Experiment indicates that the mixing angle 0, ~ 11°. Since the no and mg 
have identical quantum numbers, their wavefunctions can mix. However, the 
simple quark model here does not explain why they mix. The actual reason 
for the mixing has to do with the underlying theory of the strong SUc(3) 
interaction — quantum chromodynamics, or QCD — which induces continuous 
transitions between the quark-antiquark pairs in the y and y’. It is the task 
of this more fundamental theory to explain the origin of this mixing angle. 

For the vector mesons we have an analogous picture, given in fig. 16.3 
where now 


107) == [ud) br 
1°) = 3 (um — dd) v 
|07) = |du) Yr 
|K**) = lus) pp (16.20) 
|j?) = ds) wh 
[K^ =-|sd) v 
|&* em su) Yh 


*The minus signs in eqs. (16.14) and (16.15) are irrelevant phase conventions that appear 
for technical reasons [31], but they do have the effect of modifying our linear combinations. 


For example, a para-isospin state is 
1 Le edes 1\|. 1, 2) . 1 i 5) ae 3)) 
i , iz i i i Qd i Qi 
V2 237a gue QU prec 


= 75 (lu) ld) + 1) [u)) 
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FIGURE 16.3 
'The spin-1 mesons. 


in analogy with the spin-0 case. As with the 7 states, the Hp and ¢g states 
are 


(0) = — (qum) |dd} --|ss)) 4% (16.21) 
lós) = -7 (Iun) + |dd) — 2155)) v (16.22) 


and so we see that the p-meson is an ortho-spin and ortho-isospin state, 
whereas the Ho is an ortho-spin but para-isospin state. And as before with 
the spin-singlet case, the actual ¢-states observed in nature are orthogonal 
linear combinations of the $9 and $g states : 


1 S E 
Fa (lum) + |dd)) (16.23) 


|p) = — sin 04 |óo) + cos 04 |a) = |ss) (16.24) 


|w) = cos 65 |óo) + sin 65 |¢g) = 


where now experiment indicates that the mixing angle 0, ~ 35°. This makes 
the ¢-meson almost completely |ss) (explaining why it preferentially decays 
into KK pairs), and the w-meson 5 (Jua) + |dd)) (explaining why the masses 
of the p-meson and the w-meson are almost the same). The origin of the 
mixing angle 05 is beyond the scope of the simple quark model presented 
here, and requires QCD for a more complete explanation. 
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Mesons are hadrons with zero baryon number. It is straightforward to 
figure out their general properties under P and C. For parity, the orbital 
angular momentum l contributes as for the hydrogen atom. A fermion always 
has a parity opposite to that of its antifermion counterpart; by convention 
we set all quarks to have positive parity and all antiquarks to have negative 
parity. Hence P = Cie for a meson. Any system consisting of spin-4 
particle/antiparticle pairs will emit one photon if it (a) undergoes a transition 
between two states where the orbital angular momentum changes by one unit 
or (b) if one particle undergoes a spin-flip relative to the other particle. Since 
photons have C = —1, we have C = (—1)'** for flavorless mesons. Otherwise 
C is undefined. Summarizing: 


P =(-1)'*! — for all mesons 


C =(-1)'T* for flavorless mesons 


Many other mesons beyond the ground state scalar and vector mesons exist, 
and their quantum numbers are compatible with those of a spin- 3 particle- 
antiparticle pair in an excited state with nonzero orbital angular momentum. 


16.3 Mass Relations 


The differing masses between the various baryon and meson states can ap- 
proximately be accounted for by assuming that the strange quark has a mass 
ms ~ 150 MeV. For example in the decuplet we have 


My» — Ma ~ 152 MeV — Mz — My ~ 149 MeV 
Mo — Mz» ~ 139 MeV (16.25) 


which can be explained reasonably well by setting ms ^ 150 MeV, but for the 
octet we have 


Ma — My ~ 177 MeV Mz — Ma = 203 MeV (16.26) 


which is pretty crude. We should be able to improve on this by taking the 
quark model more seriously to include a better description of how the quarks 
bind. Since we are working with constituent masses, we have already included 
the energy of the gluons. However, we have not taken account of interactions 
due to quark spin. Specifically, if mesons and baryons really are bound states 
of spin-4 quarks, presumably spin-spin interactions between the quarks should 
play an important role. 

Why? Recall that the magnetic dipole moment for a fermion is of the form 


ji= 958 (16.27) 
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where g = 2 for a spin-5 fermion (modulo small QED corrections). Any two 
fermions will have a dipole-dipole interaction, whose energy is 


aipore RÍ Hz Ha (16.28) 


where rj is the separation between fermion 1 and fermion 2. In a hydrogenic 
atom this kind of interaction takes place between the electron and the spinning 
nucleus, and splits each energy level into two very close levels. Such a splitting 
is called a hyperfine splitting; for a hydrogen atom in the S-state we find that 
this interaction — when averaged over all directions — vanishes everywhere 
except at the origin. This gives 


8r 4rah? = z 
AEnp. = (H iste Vand cee = (=) ——— Se: Sp |Wn—0(0)|? (16.29) 


€ 
3 / MeMpc? 


where S = Ea, and the a is the fine-structure constant. Note that the energy 
is proportional to the square of the wavefunction at the origin due to the 
averaging procedure. 

Quarks carry a color charge, so we expect a color-magnetic-dipole interac- 
tion?. The color potential is proportional to 1/r (we'll see why in Chapter 
18), so we expect that for mesons 


Mmeson = (sum of quark masses) + (color dipole interaction energy) 


1/8TY 47a, = z 
= mq | Mg ( ) Sq j Sq IY meson (0 (16.30) 


3X3 / mmg 


where a, is the strong interaction constant, and the factor of 3 is due to the 
three quark colors. For baryons we expect 


| 2 


3 
1/87 ÁTOs 3 z 
Mbaryon = ma, + Mg, + ma, d 6 ( 3 ) > m Sq; * Sq, Vyaryon(0) 
i<j qi' "qj 


(16.31) 
where the factor of 5 is also due to quark color. 
We can work out the spin-spin terms as follows. Recall that 


O ae mg (1 Ss 
ANA = a E E 2s (16.32) 
2 12 4 
and that S, + Sá = add Hence 
in ad la a slips phe eu 
Sq : Sg [meson) = 2 [ Smneson 3 Siweson => Sy . Sq TVG" al [meson) 


$Of course quarks also carry electromagnetic charge, leading to an electromagnetic dipole 
interaction — this is negligibly weak compared to the color dipole interaction. 
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3 
Smeson (Smeson + 1) — 4 3 [meson) 


2 


) [meson(16.33) 


= 2 
— 3h. for smeson = 0 (scalar mesons) 


2 
i HR for Smeson = 1 (vector mesons 
4 


giving a clear energy difference between vector and scalar mesons. Using 
My = Ma = 310 MeV and m, = 483 MeV (16.34) 


we can fit all vector and scalar masses to the formula for Mmeson above to 
within 1% provided 


1/8 2ma V^ 
5 (F) ara. Umes (0)? = ( T ) x 160 MeV (16.35) 


which is a huge improvement! 


TABLE 16.2 
Quark-Model Meson Masses 


Calculated Observed 


T 140 138 
K 484 496 
n 559 549 
p 780 776 
w 780 783 

K* 896 892 
$ 1032 1020 


Note that the masses of the up, down, and strange quarks differ considerably 
from those given in table 16.1. This is because the masses in table 16.1 are the 
current quark masses: the mass of a quark “by itself.” However, a bound quark 
is continually exchanging gluons (the force carriers of the strong interaction) 
with its counterparts in the bound state, which adds an appreciable amount 
to the total energy of the baryon or meson. It is convenient to define the 
constituent quark mass: the current quark mass plus the mass of the gluon 
particle field surrounding the quark. So for the strange quark its current 
mass is 104 MeV (a modern refinement over the original crude estimate of 
150 MeV), whereas its constituent mass is 483 MeV. Similar considerations 
apply for the up and down quark masses. 

Clearly we're on the right track. For baryons, the spin product is a bit 
trickier. If all quark masses are equal (as in the N, A, and Q) then 


A ed 
5 Sq, + $4, |EMB) 
i<j Maq, Mag 
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MU cq eee ETE 
= [Si - S2 + S e S, 5) IEMB) 
q 


1 > > > > > > > = 
[Sbaryon * Sharyon — $1 + $1 — 52+ S> — Sa 8| JEMB) 
q 
n E d vH 
Na =3 (decuplet 
A a eo EMB) (16.36) 
m2 | —% for Sharyon = 3 (octet) 


where EMB means “equal mass baryon,” giving 


3h? (3270, 2 

My = 3m = 3 ( = ) |Ybaryon(0)| (16.37) 
3n? (321A, 2 

Ma = 3m irs ( = ) [Wbaryon(0)| (16.38) 
3h? (3270, 2 

Ma = 3m, 175 ( 5 ) | baryon(9)| (16.39) 


Actually, for the decuplet all spins are parallel (every pair combines to make 
spin-1) so 
1(1+1)h? |decuplet) = ( Sy + S, 2 |decuplet) 


( 
= (&- E M EE I» [decuplet) 


= ¿e + wh? +25), S, J |decuplet) 
> Se : Sy i Ue for the decuplet (16.40) 
which gives 
My. = 2m, + m, E (zs =) (=) |W aryon(0)(16.41) 
Mz» = 2m, + m, 4 € =m) (=) |W aryon(0)((16.42) 
for the remaining decuplet states. For the X and A octet states, the up and 


down quarks are respectively in isospin 1 and isospin 0 states, which means 
that their spins must respectively combine to give 1 and 0. Hence 


Me BND 
1(1 + 1)h? Joctet- X) = (8. + Sa) loctet- 2) 


= E + "s +25 54) loctet- 5) 
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d= "d for the octet X (16.43) 
332 
( ut Sa) loctet-A) 
3,2 | 3,2 God 
= qh + qh + 28, : Sa loctet-A) 
> S, . S, = -žy for the octet A (16.44) 
and so 


My = 2m4, + Ms + 


(E : Sase is ai 


MuMa MuMs 
32777 a5 2 
x ( 18 ) baron (0)| 
H? (1 4 32770, 2 
= 2m, + ms + € — ) ( 5 ) [arson (16.45) 


| 2 


(16.46) 


3h? (32120, 
Ma = 2m, + m; ( ) Vpoaryon(0) 


4m2 18 


Finally, the mass of the © is computed similarly to that of the X, and so 


B1 4 32120, 2 
Mz —2m, + m, + ( ) ( s ) |baryon(0)] (16.47) 


4 (M? mums 18 


and so we have predictions for the masses of all states in the octet and de- 
cuplet. Using the quark masses we used for the meson case, we find that 
setting 


32770, 2 2m, MV? 
( "s ) Mbaryon(0) -( s ) x 50 MeV (16.48) 


yields agreement with experimental data to within 1%, as you can see from 
table 16.3! 

The actual mass of a hadron can be can be understood as consisting of two 
components: (a) the current mass Meyrrent from the quarks themselves and 
(b) a contribution Am from the electric charge of the hadron (due to the work 
required to put a charge on the particle), so that m = Meurrent + Am. If all 
baryons in an octet have similar charge distributions, then we expect 


Amy = Amy+ Amy- = Amz- Amzo = Amn (16.49) 
Summing these equations and adding the bare masses to each side gives 


Mp + myy- + Meo = My+ + mz- T Mn (16.50) 


Mp — My = My+ — My- + Mg- — mzo (16.51) 
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TABLE 16.3 
Quark-Model Baryon Masses 


Calculated | Observed 


N 939 939 

A 1116 1114 

3 octst 1179 1193 
= 1327 1318 

A 1239 1232 

Y accoplet 1381 1384 
= 1529 1533 

Q 1682 1672 


which is a formula due originally to Coleman and Glashow [154]. Empirically 
we have 


my — m, = —1.3 MeV (16.52) 
me — my- + msz- — mz = —8.0 MeV + 6.4 MeV 
= —1.6 MeV (16.53) 


providing good confirmation that up and down quark masses are nearly iden- 
tical. 


16.4 Magnetic Moments 


The magnetic moment of a baryon should be the sum of its quark magnetic 
moments: 

Hraryon = H1 + Ha + Hs (16.54) 
If the quarks are pointlike, then 


lanar 2 EN NEL Couarkh _, 
mo Sua = ne = ric (16.55) 
2Manark C MauarkC QM guarkC 


ra — 
Pquark = quark 


where gauak = 2 to a high-degree of approximation (i.e. neglecting its anoma- 
lous magnetic moment). The magnitude of a baryon magnetic moment is 
defined to be the expectation value of the magnetic moment operator of a 
spin-up baryon states: 


|fivaryon| = Hp = (baryon f| (Eia + la + iia); |baryon 1) (16.56) 
where 
eji 


2mjc 
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_2 eh _ 1l eh EE E 
x xm 3 2muc Hd 3 2mqc m 3 2mgc 
(retaining the factors of 7; and c), and where g, | = +I; ¢|1) 2 —-|D . 


Let's compute this for the proton. Recall that the quark model predicts its 
wavefunction is given by eq. (16.9) 


pt) = 5 (2lutdlu=luldtun—=HTulun 


21d utut) *2luTutdl)-lutuldT) 
-htdtul)-Mtutul)-lutetdt)) (16.58) 


and we must compute (p | (ui + A2 + fiz), |p T) term-by-term. For example 


(ity + fla + ñs), lu 1 d Lu 1) = (uu + (ED) ta ua) lu 1 d Lu 1) 
= (2n. — pa) lu Y d | wT) (16.59) 


and so the first term contributes 


ae SS 2 
(m) (utd |u| (fi + Ha + fis), ut dl ut) = 9 x uu — na) (16.60) 


The second term contributes 


2 
(=) (u Ld Tut Gh + f+ fs), lu Ld tut) = sma (16.61) 
2 


and it is easy to see that all other terms contribute either $ x (2j — Ha) or 
5 Ha to the magnetic moment. Noting that all states in the proton wavefunc- 
tion are orthogonal, we obtain [153] from a term-by-term calculation 


1 
= = (4Mu — 16.62 
> = 5 (4u =a) (16.62) 


oreson = [3% E (Que — pa) +6 x xc (Pa) 
We can repeat this for all the octet states, and compare to experiment. As 
table 16.4 shows, agreement is good, but not outstanding. Clearly the quark 
model will have to be modified (by including more detailed interactions be- 
tween the quarks) if we are to improve on these predictions. 

The X? state has a very short lifetime of only (7.4 + .07) x 107°? seconds (as 
compared to lifetimes of 10^ !? seconds or longer for all of the other spin-1/2 
baryons). Hence the magnetic moment that can be measured is a transition 
magnetic moment associated with its decay into a A particle. This quantity 
is not really comparable to the magnetic moments of the other octet baryons 
and so I have not filled in a value for it in table 16.4. 
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TABLE 16.4 
Quark-Model Magnetic Moments 
Formula Calculated Observed 

p 1 (Au, — pa) 2.79 2.793 
n 1 (Apta — Hu) —1.86 —1.913 
A Ls —0.613 —0.613 + 0.004 
xt S Hu — ils 2.68 2.458 + 0.010 
X? 2 (ua + Hu) — ius 0.82 (see note below) 
xs == elle —1.05 —1.160 + 0.025 
z0 ils — zu —1.44 —1.250 + 0.014 
24 als — glad —0.51 —0.651 + 0.003 


(all quantities in units of nuclear magnetons 
UN = n ; = 3.152 x 1071? MeV /Gauss) 


16.5 Questions 


1. A well-known theorist at the University of Waterloo tells you that he has 
been able to solve the three-body problem in quantum mechanics. As 
expected, a fully symmetric spatial wavefunction describes the ground 
state. A fully antisymmetric spatial wavefunction describes the first 
excited state. Assuming this is correct, how many baryon states does 
the quark model predict at the first excited level? 


2. Verify the table 


Magnetic Moments of Baryons 


Formula 

p 3 (bu — Ha) 

n 3 (Apa TAS Hu) 

A Hs 

Et SHu m: 3Ms 

x 2 (ua + Mu) — ius 
AC jd Hs 

E gis — 3LHu 

a 3 Hs — 3 Hd 


3. Consider the spin-spin potential 


K a ut 
= 01020 (712) 
m1ma 
between a quark and an antiquark of masses m1, M2, separated by a 
distance 712 where K is a dimensionless constant in units where c = h = 
1. 
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(a) Calculate  |W (0)|^ from experiment for the cz system, in units 
where c and 7i are not set to unity. 


(b) Compute the splitting of the 3S4 and +So states for the Y system, 
which is a bound state of a b-quark with its antiparticle. 


(c) Does this work for the light mesons (e.g. for the p-r splitting)? 
Why or why not? 


Calculate the masses of (a) the J/q (a bound state of a charm quark with 
its antiparticle) and the D-mesons (bound states of a charm quark with 
antiquarks of lighter mass) and (b) the B-mesons and the Y. Compare 
your results to experiment, and comment on the implications for the 
quark model. 


Suppose that there were only two colors, say red and blue, with the color 
symmetry being given by SUc(2). Baryons would consist of colorless 
bound states of quarks, whereas mesons would consist of colorless bound 
states of a quark and an antiquark. 

(a) Write down the color wavefunction for a baryon and the color wave- 
function for a meson. 


(b) Are baryons fermions or bosons? Are mesons fermions or bosons? 
(c) How do the lowest-energy baryon and meson states differ? 

Suppose that there were N colors, with the color symmetry being given 
by SUG(N). Baryons would consist of colorless bound states of quarks, 
whereas mesons would consist of colorless bound states of a quark and 


an antiquark. Write down the color wavefunction for a baryon and the 
color wavefunction for a meson. 


Are any of these processes allowed? Why or why not? 


a) P — dem (b) n — 37 
(c) o? — «? +7 (d) pf — rt +r +r 
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Testing the Quark Model 
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Can we be confident that quarks actually exist? Or is the quark model noth- 
ing more than a clever classification scheme for mesons and baryons? This 
issue came to dominate the scientific community in the late 1960s, and many 
remained skeptical of the existence of quarks. This began to change in 1968 
when experimental evidence was found that protons had constituents called 
PARTONS. For a period of time the theoretical and experimental situation 
was in a state of confusion. However, after a few years partons came to be 
identified with quarks, and a series of experiments were carried out that pro- 
vided the best “direct” evidence for the existence of quarks that we have. 
Today quarks are an established part of the Standard Model. 

Quarks have never been isolated as free particles, so it is generally thought 
that they are always confined to hadronic bound states. If this is the case, 
how could we possibly observe them? The best way is with electromagnetism. 
Everything in QED that applies to electrons also applies to quarks, provided 
the appropriate charge of ze or -łe is used. Since quantum electrodynam- 
ics is well understood, electromagnetic interactions furnish a useful probe of 
hadron substructure — if quarks exist, they should be able to interact electro- 
magnetically with electrons, after all. 

There are three basic processes that give us experimental information about 
quarks: vector meson decay [155], hadronic production, and deep inelastic 
scattering. The first two of these are (roughly speaking) time-reversals of 
each other, and are shown in fig. 17.1. 


17.1 Vector-Meson Decay 


Consider the decay of a vector meson (i.e. $, p or w) into a lepton-antilepton 
pair (i.e. por e). The diagram at the left in figure 17.1 gives, after doing the 
ó-function integrals 


i = vg 2 —te : H Un. 1 o Iw 4 » / —ie v Vp / 
iM = Ta, (pa) (Lies 7") t, n) Te (P3) (ier”) ves (p1) 
= ME [m (p2) (YF) us; (m)] fue- (p3) (Yu) ve (pi)] (17.1) 
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We could simply square this and put it into a cross-section formula if the 
initial quark and antiquark were free particles. However, they are not free 
particles — they are bound together into a meson. The total quark momentum 
is therefore (pı + p3)? = M$,the mass of the vector meson, and this has been 
put into the formula (17.1). In fact the meson is described by the bound state 
wavefunction Vmeson = ® (space) y (spin) x (flavor) y (color). We know that 
the color wavefunction gp (color) is guaranteed to be neutral (i.e. a singlet) 
since it is made from color-anticolor pairs and that 9 (space) should be a 2- 
Body ground state wavefunction. The y (spin) is accounted for by the Dirac 
wavefunctions above, but we need the proper 4 (spin) x (flavor) combination. 
So the matrix element should read 


n 
la; 
E Ai 2 2 X(Yöq, (pa) (4!) us (p1) | Fe- (05) (01) ve (21) 
(17.2) 
where we sum over the n flavors in the meson and a normalization factor of 
TAT has been inserted. 

To get a decay rate we need to compute the spin-averaged matrix-element 
and integrate over the spatial distribution of the quark-antiquark combina- 
tion. Since we know that the quark and antiquark will only annihilate each 
other when they are at the same spatial position, this integration must give 
something proportional to the square of the wavefunction at the origin. Using 
the Casimir trick and taking traces* 


XL [Pon toos] Peo") wen 


e (+m) ml 
= 4 (pipi + pip — g"" [má + pı - Dol) x 
= LY" (py, pa; —m;)Ójk (17.3) 


where 
LH” (pi, po; ^m?) = 4 (pf p5 + ptpb + g"" [-m? — p - p2]) (17.4) 


and where the 6;,; appears because we get zero unless the two flavors of quark 
in the meson are the same. This yields for the square of the spin-averaged 
matrix-element 

2 


aM |® (0 Po x(j) LEY (p1, poi — mi) Luv (pA, pa; — m?) 


(17.5) 


|? 2 


|M 


*This procedure is exactly the same as the one we carried out in Chapter 13, so I will not 
repeat any details here. 
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where |y (color)|? yields an overall factor which is the same for all mesons. 
Since this expression is in an arbitrary frame, we can write as a non-relativistic 
approximation 
Ma, Ma, 
po n poll GG 

Pi = My av PET M,” (17.6) 
where we ignore the relative momentum between the quark and the antiquark. 
Since mg, = Mg, this gives 


m2. 
LY” (pi, pa; m3) — 4 E Py — g^" Bi] 


Mi 
H uv 
PyP 
ae —8m?. ( Av V £) 
95 Mg 
H Y 
v PyP 
= —2M2 c = ert) (17.7) 


since Mg, = iMy — the mass of the vector meson is twice the quark mass! 
For the leptons emitted in the process, we have p“ = (E, p) and p¥ = (E, —p) 
in the rest-frame of the meson, where |p| ~ E neglecting the mass m of the 
leptons. 

Hence we get 


LH” (py, pa; =m?) Lur (py. po; =m’) 


pi:pv)(p»: p p 
-aMg att pb) - A +20) a0 Pv) a PY) 5 Gay q) 2 


E MZ j 1'P2 MZ 
(EMv) | MZ 

= -8Mg E (2E?) - 2 me? (2E?) MZ 

= 8M} (17.8) 


since E = iMy by energy conservation. Hence the matrix element becomes 


2 
n 


NE (s RE 
[Mo qr e OP [5 5 7 xG) (17.9) 
j=l 


where the constant of proportionality is related to a color factor that is the 
same for each meson. 


The two body decay rate Dz s,4, = aig Mi becomes 
2 
pupas a? a % €q; 
Be St) aa XO) (17.10) 
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and will be multiplied by a function of the momenta of the initial and final 
states, but (apart from the factor of M. E, ) is common to all the vector mesons. 
Using 


2 1 
eu = 3° and eg =e, = 2 (17.11) 
along with the flavor wavefunctions for the vector mesons, we find 
1 = 
xe va (lva) — |dd)) 
3 2 
A rua €g 12^ 1 1 1 
=> “i = = 17.12 
x] o) an» 
j=1 po 
2 
t eq, Du 1 
x (#) =188) > |x) —|l73)| =5 (17.13) 
j=l 
de nct 
xlo) = Je (uz) + |e) 
d 2 
2 n 12 1 1 1 
=> j| = = 17.14 
Yon tuos) m un 


yielding 


T (99 — OO) :T (p — Oe): Twoere) 


2 
=J 9x2 (2) yl predicted (17.15) 
8.8 + 2.6 : 1.70 +0.41 : 1 observed 


where you might recall that the p and w have similar masses. This corrobo- 
rates the flavor assignments for the ¢ and the w we made in Chapter 16, and 
is a test both of these assignments and of the fractional charges of quarks. 
However, we can provide better evidence for the existence of quarks by looking 
at this process in reverse, as we consider in the next section. 


17.2 Hadron Production 


When ete” — qq (via a photon) the quark and antiquark will fly apart 
as free particles until they reach a separation of about a fermi (107% m). 
At this separation the strong interaction is so powerful that new quarks and 
antiquarks are produced, which in turn will bind into hadrons. Dozens of 
hadrons are formed, and the resultant state is a mess. However, momentum 
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£t EB 


Vector meson 


FIGURE 17.1 
Diagrams for vector-meson decay (left) and Hadronic production (right). 


conservation will force them (in the CMS frame) to emerge in two cone-like 
streams called jets — each jet tracks the primordial quark or antiquark from 
which it arose. 

The cross-section for this process will be just like that for ete” — u*u- 
except that we must replace the charge of the muon (antimuon) with that of 
the quark (antiquark). The diagrammatic rules give from fig. 17.1 


iM = [ug(p5) (—teg7") vg(p1)] SM z [Ber (p3) (—iey”) ue-(p1)] (17.16) 
(pi + p2) 


which upon spin-averaging and spin summing give as before 


1 2 
T1142 
IM = 5 


€€q 


——=3 3 | L” (p py; m2) Lys (pi, po; ^m?) (17.17) 
(pı + pa) 


The dot products work out to give 


2 


——2 ee 
|M| =8 a ((pi * p1) (ps - pa) + (mi : pa) (p5 - pı) 
1 2 
+m? (py : p1) + ma (pa: p1) + 2m?m2) 
m? m2 m2 m2 
= (ee? |1 + s exe un 1 E cos? 0 | (17.18) 
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where the last line is in the CM-frame, with 


p =- =p, |P= vE- m? (17.19) 
El 


Now write eg = Qe, where Q is either 2/3 or —1/3. The differential 2-Body 
cross-section is 


do (heV WI 191 
dQ. 8r) AE? |p| 
_ (he)? @ (Ana)? [51 
XA 8 8E? |p| 
2 m? 2 m? 
eE (1 zs) (: Best (17.20) 
or, integrating over the solid angle dQ = sin 0d0d6, 


aes qan) lic? Q? (tray E? — m 
i 8m 8bE?  J/E2— m2 


I 


2 m? m? 
2 bog n. x). > ( es) E ze || (17.21) 
which can be simplified to 

7 ( Qfhica y1 m? m? 

= 1 1+ —> 

" =) a Elm) (e 

E? 
h 

= E (Sm) for E > m, mq (17.22) 


Notice that for E < m, the square root is imaginary. This means that 
there is not enough energy to create the quark-antiquark pair and the process 
is kinematically forbidden. 

As the beam energy E increases we will encounter a succession of thresholds: 
first the muon, then the light quarks (u, d, s); later (at 1500 MeV) the charm, 
the bottom (4700 MeV) and eventually the top quark (171,000 MeV). So we 
expect (for E > m, mq) 


s m¿<E 


| a (ete — hadrons) , 
E a (ete- —> utu =) = Y), `X Qavor = 3 5 Qj (17.23) 
j=1 


colors flavors 
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which gives (neglecting corrections due to mass) 


942 1\2 1\2 
R=3x (2) + (-) + (-3) | =2 above the u, d, s threshold 
DN 10 
=3x G +2 = 3 above the c threshold 
e 10 1 
=3x = above the b threshold 
3 3 3 
ON FT 11 
=3x G) + 375 5 above the t threshold (17.24) 


There is one correction that needs to be included: the appearance of the 7 
lepton! (1784 MeV), whose discovery by Martin Perl and collaborators at 
SLAC in 1975 [157] came as a surprise as experimentalists checked the value 
of R as a function of energy. Once its presence was understood the prediction 
for R was found to be in pretty good agreement with the data (see fig. 17.2), 
especially at large E. The color factor 3 clearly needs to be there, providing 
experimental evidence for color! Of course we have no experimental data 
for the top since there is no ete~ collider that can reach the requisite energy 
threshold. 

Why isn't the agreement better? First, there is a correction from QCD that 
corresponds to the radiation of a gluon, and shows up in the final state as a 
3-jet event. The process yields 


ma«E 2 
R=3Y Q (: ES 22 (17.25) 
j=l 

at a center of mass-energy E, where a, (E?) is the running coupling constant 
of QCD, analogous to the running value for a in QED. This gives a correction 
of about 5%. Furthermore, in calculating the above process, we treated the 
quarks in the final state as free (i.e. non-interacting) particles. However, 
this isn't correct — they are actually virtual particles that are on their way to 
becoming hadrons. When the energy is at a value that is right for forming 
a bound state, the interactions between the quarks dominate, leading to the 
spikes in the above graph. Here our simple approximation fails badly, but 
the actual QCD calculation (the solid curve in figure 17.2) does an excellent 
job of matching the curve, including the resonance peaks. The flat regions of 
the graph show that our approach using eq. (17.24) is reasonably good for 
energies that are not nearby any resonance. 


+The name comes from the Greek Tpurov, or triton, meaning “third”. 
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FIGURE 17.2 

World. co ou dhe vg cross section of e* e^ — hadrons and on the ratio 
Rz cpm ue) 'The broken curve is the naive quark-parton model 
prediction (17.24), and the solid curve is the 3-loop QCD prediction [156]. 
The Breit- Wigner parameterizations of the J/y, (25), and the T(nS), (n = 
1,2,3,4) are also shown. Image courtesy of the Particle Data Group [1]. 
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17.3 Elastic Scattering of Electrons and Protons 


At modest energies electron-proton scattering is elastic: e” + p — e +p. 
Based on our work in QED we might expect the spin-averaged /summed matrix 
element to be similar to what we got for the muon, which is 


2 
= Le” (py, p1; m?) Lu (p, po; M?) (17.26) 


where momentum conservation implies q? = (p — pi)’, and Luv is given by 
our formula above. However, this assumes that the proton (assumed to have 
mass M) is a structureless point particle, just like the muon. 

This is not right — the proton has structure. For example, we saw in Chapter 
16 that its magnetic moment is 2.79 nuclear magnetons, vastly different from 
a pointlike spin-4 particle. Consequently we shouldn’t assume that photons 
interact with it in the same way that they would with point-like objects such 
as muons and electrons. The actual lowest-order diagram for this process 
looks something like what is in 17.3, where the blob denotes our ignorance as 
to how the photon and the proton interact. 
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FIGURE 17.3 
Scattering of an electron (thin line) off of a proton (thick line). 


Except for this blob, the diagram is unchanged: in particular, the photon 
propagator and photon-electron vertex is unchanged, and so we expect 


y APN 
IM| => (5) LH” (py, py; m?) K (pa, pa; M?) (17.27) 
where K,, is an unknown quantity describing the photon-proton vertex (or, 
more precisely, what you get from squaring this vertex). 


17.3.1 The Photon-Proton Vertex 


We don't have any diagrammatic rules to tell us how to compute K,,. How- 
ever, we can make use of some basic physics that we know must be true for 
the proton: namely that it respects Lorentz invariance and electromagnetic 
gauge symmetry. This means that K,,, must obey the following criteria. 


1. Kj, depends only only two 4-vectors, py" and p$, or alternatively q” = 
ps — ph = pt — pt and pj . Since K,, has two Lorentz indices, it 
can depend only on quadratic products of these two 4-vectors or on the 
metric, each multiplied by a different scalar function of the momenta. 


2. Since L*” is symmetric, Kyy must also be symmetric. Hence, in combi- 
nation with the previous criterion, its most general form is 


Ko K3 Ka 
Kyu = K19uv | 2 Pen Pav | M2 QuQv d M2 (duD2» + qyPay) (17.28) 


3. The functions Ky, Ko, K3, and K4 must be scalar functions of q^ 
and p5, and so depend only on q?, q p2 and pł. From momentum 
conservation we have 


1 
p;— M* and q.po— —5d (17.29) 
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which means that the K; are functions of q? only. 
4. Gauge invariance implies conservation of the electron current pyty, or 
9, (y^) = 0 = q, (Vy) =0 
=> q, L” (p,,pi; m?) 20 (17.30) 


where this follows upon Fourier-transformation [97]. Eq. (17.30) can be 
easily checked and in turn implies that without loss of generality 


qu K"" (p3, pa; M?) — 0 (17.31) 
whose derivation is not obvious; I’ve left it as an exercise. This finally 
yields 

K K 

-Kiq + aua (q: pz) Pav + ad & 
K4 

F (dpa + (q: p2) q) =0 (17.32) 

M? 1 1 
K3= a K; + ¿Ka and K4 = he (17.33) 

so that 
2 
= 2 Qudv Ko (q ) 1 1 
Kyu " Ki (q ) ( q a ow) + M2 P2p 1 9 du Pov + 3dv 

(17.34) 


17.3.2 The Rosenbluth Formula 


It is up to the experimentalist to determine the specific forms of the K’s and 
to the theorist to find an explanation in terms of proton structure for these 
forms. Inserting this into the matrix element above gives 


——2 e? 2 
|M] => (5) LH” (p, pi; m?) 


2 
c Kə (q ) | 1 1 
= ow) G M? P2y 1 9 du P2 + at 


Ape 
(5) & (pi pt + ppt — gt” [-m? +p- p1))] 
i 


Q 


Kı (q Juv + E papa (17.35) 


which simplifies to 


fg) Leia (maga 
(17.36) 


Testing the Quark Model 319 


In the lab frame we have 
p3 =(M,0) pi=(E,9) pË =(E',p') (17.37) 


and so eq. (17.36) becomes 


IM] 24 E | (r | 7) Kı (4?) + Ko (4?) c B: 7) (17.38) 


q 


where q? = 2 (m? — EE’ + |p'"||p1cos8) = 4EE sin? $ +0 (er) < 0. 


At energies E, E/ > m, the differential cross-section for e” + p > e- +p 
in the lab frame is 


d h 2 25,2 
a) = um (17.39) 
a 8r) M |p| [9] CE + Mc?) — |p| E' cos 6] 
2 
a E' 0 0 
= 2K, (q?) sin? — + Ko (q? 20 
(a) (=) | 1 (97) sin z t 2 (q?) cos 5 


which is called the Rosenbluth formula [158]. Note that the kinematics of the 
situation imply that q - pa = -iq,, Or 


E = —____ (17.40) 


at energies E, E/ > m. This means that the Rosenbluth formula has only 
one independent parameter: the scattering angle 0. 

By counting the number of electrons scattered in a given direction for a 
range of incident energies we can determine K;(g?) and Ko (q?). However, 
it's customary to define 


2 2 q? 2 
Kı (4?) = nid. (d) and Ko (q?) = Gr (4) + aie Cm (4%) (17.41) 


+ a 


where Gg (q?) and Gu (q?) are respectively called the electric and magnetic 
form factors for the proton [159]. A comparison of the measured differential 
cross-section (43). TN with the Rosenbluth formula above (17.39) then 
allows a measurement of the proton form factors. This provides us with 
information as to the structure of the proton. 

How can we obtain direct information about the quarks inside the proton? 
This is the subject of the next section. 
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17.4 Deep Inelastic Scattering 


Rutherford's experiment scattering alpha particles off of gold atoms demon- 
strated that the charge of an atom is concentrated in its nucleus. It pro- 
vided firm evidence that the Thomson model - in which positive and negative 
charges were uniformly distributed throughout the atom — was not correct, 
paving the way for the Bohr model and, ultimately, our current understand- 
ing of the atom. The inelastic scattering of electrons off of protons has had a 
similar impact in particle physics. 

In an inelastic scattering process, the particles in the final state are not the 
same as in the initial state. In the context of electron-proton scattering, we 
now have e” + p — e + junk. The diagram is now given by figure 17.4, 
where the “junk” represents all possible particles that can come out: pions, 
Kaons, A's, =’s — you name it! The main distinction between this case and 


X-junk 


FIGURE 17.4 
Deep inelastic scattering of electrons off of protons. 


the previous elastic scattering case is that the final state momentum 
py = (P3 H + ph)” (17.42) 

is no longer the square of the proton mass, i.e. 
l 


p? 7M? =>q:p%-54 


> (17.43) 


Except for this feature, the rest of the process is the same: in particular, 
the photon propagator and photon-electron vertex is unchanged. Hence we 
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expect for a given final state X,, (containing n particles) in the process 

—— 2 1/e : pn 2 

|M (Xs) = = zZ L” (p,p; m?) Ky, (X4) (17.44) 


which will give a scattering cross-section 


nS 


do, = 


2 
44/ (pı : p2) — pips 


x (2r) 6 (x + NU = p2 — n) (17.45) 


1=3 


for the state Xn. 

It would be extremely difficult to compute this for a particular but arbitrary 
final state. Fortunately this is not necessary! Experiments typically only 
measure the momentum pj of the outgoing electron. Hence all we need to 
do is integrate over all possible final states X = (X1, X2,... Xn} that could 
occur. This gives 


TM en” c (d*p| y 
do = e (5) E Hyr (pi, pi; m*)Ww (pa, pa; M?) 
(pi: pa)” — pipa I 


(17.46) 
where now W,» is an unknown quantity 


W,v(p5, po; M?) = zur »» Ws pi) J Ku(Xn) 


x (2n)* 5 (x + > -pi— J | (17.47) 


1=3 


describing the photon-proton vertex (or, more precisely, what you get from 
squaring this vertex). We call this the inclusive matrix element (and the 
resultant cross-section the inclusive cross-section) because W,,, includes all 
accessible final states X. 

The energies that occur in the experiment are typically much larger than 
the mass of the electron. If the mass M proton is at rest, then 


(pi: po)” — rip = MV E? — m? ~ ME (17.48) 


where E is the initial electron energy. We also have d?p, = |p1 | d |p| dQ ~ 
+ 2 / 
(E dE'dQ, where E is the energy of the outgoing electron (if the electron 
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has enough energy to blast apart the proton, then its mass is negligible). 
Hence 


x (lie? V? E (dE'dQ) ,,,, 
do =$ (=) TUS ) pe (pipi; m?)W,,(po,po; M?) —— (17.49) 
or, more compactly, 
do ha N? El 2 
a (=) i (17.50) 


since e? = 4ra and I have inserted the relevant factors of / and c in the 
cross-section. 

As with K,,, we can make use of some basic physics, namely Lorentz 
invariance and electromagnetic gauge symmetry. This means that W,,, obeys 
the same criteria that K,,, did, and a similar argument gives 


E WW p Ma q:p2 q'po 
Wi = WA ( P ow) 1 M2 c g 2 c = DP) (17.51) 


which is just like the expression we had before for K,, except that now 
1,2 
q:paf —-34- 
Inserting the form for W,,, into the matrix element above gives? 


2 
L'"W,,-—8 |- G + 2) WA (4, q: pa) 


+W (g?,q - pz) pru i 7) (17.52) 


which in the lab frame (with E, E' >> m) 
py =(M,0) p=(Ep př = (EP) (17.53) 
0 
q? =2(m? — EE’ + |p '| |p| cos0) ~ —4 EE" sin? D (17.54) 
becomes 


6 0 
L'""W,, = AEE' |2WA (q?, q: p2) sin? = + Wa(qd?,q:p2) cos? 2| (17.55) 
x 2 2 


Hence at energies E, E' >> m, the inclusive differential cross-section in the 
lab frame is 


2 
do E ha 2 . 90 5 20 
dE'dQ (z) am, (q ‚q: p2) sin z t We (q , q- p) cos 5 


(17.56) 


2 


Recall that L"q, = 0 and that p] : pı = m? — S 
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This equation is the inelastic generalization of the Rosenbluth formula (17.39): 
there are still two structure functions but now they depend on two variables 


(E' and 0), since we no longer have q+ p; = — 40°, or E' 4 oe ae: 
M 2 
It's customary to write 
2 
$m (17.57) 
2q : p2 


so that the W;'s are functions of q? and z, and we have 


2 
do ha 0 0 
= 2W, (°, £) sin? = + Wa (g? =| (17.58 
dE'dQ (es ;) | xr c gene a ERR) 
The variable x is bounded by 0 < x < 1, with the elastic case the limit 
x=1. In this limit, we should be able to recover the Rosenbluth formula by 
an appropriate choice of the W;'s. After all, elastic scattering is a special case 
of inelastic scattering in which (p5)? = M?. It’s not too hard to show that 


Kia (q?) 
2M q? 


is the choice that recovers the elastic Rosenbluth formula once you integrate 
over &. 

Note that so far the only physics used in obtaining the differential cross- 
section is our assumption of Lorentz invariance and gauge invariance. In 
order to proceed further we need to make some hypothesis about the W;'s to 
construct specific models of the proton that in turn will allow us to predict 
the cross-section. For example, suppose that the proton were a simple point 
charge of spin-7. We would then have 


Wi (°, £) = 6 (a — 1) (17.59) 


2M 
Wi (d,x)- ——6(r—1) We (¢?,2) = ==0 (1) (17.60) 
At low energies — where elastic scattering occurs — this is not too bad a model. 
However, we know it is not fully correct — at high energies the proton scatters 
inelastically, and so must have some internal structure. 


17.5 Quark Model Predictions 


What would a quark model predict? Well, one thing that should happen 

is that at sufficiently high energies the virtual photon interacts with a sin- 

gle pointlike quark that is instantaneously free. This means that the quark 
structure functions should be like those for point particles as in eq. (17.59): 
; 2 ; 2m;Q? 

Wi (qmi) KTO -1 Wi (4,2) = 2S n —1) (17.61) 
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where m; is the mass of the jth quark inside the proton, and 


2 
= q €j 
jx = 17.62 
tj = TS Qj a (17.62) 


where p; is its 4- momentum. Note that the structure functions are propor- 
tional to the quark charge because the photon couples to the individual quarks 
by a factor proportional to quark charge. 

So we could compute the cross-section dg if we knew the momentum p; 
of each quark in the proton. We don't know this, because the momentum of 
the quark inside the proton is the sum of the momentum with respect to the 
proton's center of mass plus the overall momentum in the lab. Let's assume 
that the motion with respect to the proton's center of mass is unimportant: 


p; = zjp» + negligible corrections (17.63) 


where the total momentum p% of the proton is 
pow (17.64) 


Essentially this assumes that each component of quark momentum gets the 
same fraction of proton momentum - the “negligible” corrections are those 
that include the motion of the quarks relative to the proton's center of mass. 
Using this assumption we now have 


q x 
t; =— — — and m;=2;M 
j 2q- pj zj j J 
, OP fy 2 
1 Can = — ~-1)= ~s (t-z 17. 
Wi (°, zj) 77M 5 yr zj) (17.65) 
2z;MQ? fx 222MQ? 
W3 (9,1) = p Je 1)= q? *4 (a — zy) 


'To get the structure functions of the proton in this model, we must multiply 
the (pointlike) quark structure functions W} by the probabilities f; (z;) that 
the jth quark has momentum 2;p5, and then integrate over all quarks in the 
proton: 


= VQ (2) (17.66) 


Testing the Quark Model 325 


ANDE 
Wa (4,2) = x dz; f; (25) W3 (4?, 25) 


2? MQ? 


b dz; f; (2j) ) — L9 (2-4) 


2M 
= Ca? Qi, (1) (17.67) 
q J= 
or alternatively 
1 3 
MW, (@, x) = E Fi) (17.68) 
_PWe ( eg x) 3 
po ES 17. 
e dels Fa(x) (17.69) 


a remarkable result known as Bjorken Scaling [160]: the structure functions 
(appropriately multiplied) are independent of q?! 


We also see that 


F(x) = 220 F; (x) (17.70) 


which is called the Callen-Gross relation [161]. Hence the prediction of the 
quark-parton model is that the inelastic cross-section is 


2 
do ha 0 0 
= 2WA (q: in? — + Wo (92, q - AE 
dE'dQ (zer) | ı (q d p2) sin z t 2 (q ,Q p2) cos 5 


2 
Fix) ha 2EE' 29 
= | l 71 
2M ES | (E- E) 3 (1771) 


The importance of these relations is that they provide a clear empirical 
signature for the quark model. If protons are indeed made of constituent 
pointlike objects, then Bjorken scaling will hold. If the constituent objects 
are spin- 5, then the Callen-Gross relation holds (the function F; (1) = 0 if the 
quarks have spin-0). You can see how these predictions fare against the data 
in figure 17.5. They do quite well [162], and after the deep inelastic scattering 
experiments [163] were performed in the early 1970s physicists accepted the 
existence of quarks. 
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FIGURE 17.5 

A plot of F as a function of the energy transfer q?, using the data from deep 
inelastic scattering experiments [162]. Note that it has the same value at 
various scattering angles for all possible q?. 


17.6 Quark Structure Functions 


To finish the job, we need to compute PFi(x). This is not that easy to do 
because we have to make assumptions about how the quark momentum is 
distributed inside the proton, assumptions that go beyond the naive delta- 
function relations given above. If we really believe that m; — z; M, then 


f; (a) = 8 (53 - 25) (17.72) 


since the momentum fraction of each quark is fixed. This would give for a 
proton 


a) C3) G9) 
=z (e) (17.73) 


where I have set the mass of the up and down quarks equal. This would make 
Fi(x) a sharply peaked function, which experiment shows not to be the case. 
More generally, suppose that 


fu (a) = Sula) gud Posa (17.74) 
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where u(r) is the probability density that momentum fraction x is carried by 
an up quark, and d(x) is the analogous probability density for a down quark. 
The factor of 1/2 ensures that u(x) is the probability density for finding either 
up quark. This gives 


ta = 3 BES ( ;) «e| (17.75) 


The naive model would give u(x) = 26 (Ze — x) and d(x) = 6 (44 — x). We 
want to improve on this. 

One reasonable consideration in determining u(x) and d(x) is to require 
that (a) the average momentum carried by up quarks is twice that carried by 
down quarks (since the proton has twice as many ups as downs) and (b) that 
the masses of up and down are about the same. That is 


(Pup) — = 2D down) > NE dx = 2 f (ap2d(x))dax 


s me yar =2 f Gade (17.76) 


a relationship that is supported by data from electron-neutron scattering. 
This implies 


[sones fa [E me Cy ve nm one or 


Experiment indicates that the quark structure functions are 


1 1 
i| a oía f aaru (17.78) 
0 0 


which has the rather curious implication that 
1 
(P5)all quarks = / (zpzu(x) + zp30(2)) de 


= pk [ (zu(z) + xd(x)) de = 0.54p5 (17.79) 


In other words, the average proton momentum carried by all of its quarks is 
only 54% of the total proton momentum! 

Where's the rest of it? Presumably in the gluons which bind the quarks 
together. But in fact the actual answer is quite complicated — virtual quark- 
antiquark pairs appear inside the proton due to gluon interactions, and these 
also carry some of its momentum. These virtual quarks are called “sea 
quarks,” to distinguish them from the “valence” up, up, and down constituent 
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quarks of the proton. The sea quarks can be any up, down or strange quark or 
antiquark that can be produced at these energies (the probability that heavier 
sea quarks are produced is very small due to their larger masses). Since the 
exchanged photon can couple to the sea quarks as well as the valence quarks 
we should modify the structure functions to include them 


where the overbar denotes an antiquark. This now erodes the predictive power 
of the quark model — now we have six unknown functions instead of two! 
However, because all the sea quarks are produced by the same mechanism it 
is reasonable to impose 


us(1) = U,(x) = d(x) = 0,(x) = 5.(1) = 5, (zx) (17.81) 


which implies 
1 
F(a) = 18 [4u, (2) + 0, (x) + 125, (a) (17.82) 
'The shape of these three structure functions can be inferred from experiment. 
However, this is a difficult job, particularly as x — 0. It is one of the chal- 
lenges of QCD to make a definite prediction of the quark structure functions 
from first principles. 


17.7 Questions 


1. Suppose a 4th generation of quarks and leptons is discovered, with 


masses on the order of 1 TeV. What do you predict for the ratio 
R= c(e*e- —hadrons) 


> 
e(ete- —nu*pn-) 


2. Show that 
(a) qu LH” (py, pi; m?) = 0 
(b) qu K"" (po, po; M?) = 0 

3. Compute Kı and Kə for a pointlike proton that obeys the Dirac equa- 
tion. 

4. Show that the Rosenbluth formula becomes the Mott formula 


2 
do u ha 2 2 0 2 
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in the limit m << E << M, using the expressions for Kı and Kə from 
question #3. 


5. Show that 
E 


= E20 
1+277 sin 3 


1 


at energies E, E! > m for the scattering of a lepton of mass m from a 
proton. 


6. Show that 
T WW WW q: p2 q: p2 
Wi, = Wi ( E ow) NE (va, g s.) c [s DP) 


7. Show that 


2 
L'"W,,-—8 |- (r? + 7) WA (47, q: pa) 


(Pe Pa) Pe Pa) 7) 


+W (q^. q: pa) ( 


8. Show that setting 


K 2 
mia (ga) =- 


yields the Rosenbluth formula. 


9. For a proton of mass M, find the fractional momentum of the scattered 
quark of mass m, where neither the proton mass nor the quark mass 
can be neglected. Expand your result in the limit where both masses 
are small compared to the energy. 
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Heavy Quarks and QCD 
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So far we've seen that we can understand baryons as bound states of 3 quarks, 
both indirectly in terms of the observable properties of baryons and directly 
in terms of deep inelastic scattering. 

Likewise, we've seen that mesons can be regarded as bound states of qq 
pairs. In fact we can consider a meson to be a sort of a quark-antiquark 
“atom,” analogous to the usual nucleus-electron atom, or the e*e^ atom 
(positronium). Perhaps we should call mesons *quarkonium", since one might 
expect that the methods used in positronium and ordinary atoms would work 
for mesons. They do...but with two big problems. 


1. We don't know the potential. 


Unlike positronium and hydrogenic atoms, in which we know that the 
force is electromagnetic (and very well understood), the force binding 
quarks into “quarkonium” is the color force and is not as well under- 
stood. The (present-day) theory of color forces is called Quantum Chro- 
modynamics (or QCD), which is similar to QED except for some non- 
linear interaction terms. It constitutes the second major part of the 
Standard Model, and is the foundational theory governing all that we 
know about quarks and their bound states. 


FIGURE 18.1 
'The one-photon and one-gluon exchange diagrams. 


The curly intermediate line in figure 18.1 represents a gluon: the force 


331 


332 


An Introduction to Particle Physics and the Standard Model 


carrier of the strong interactions. At short-distance separations for two 
quarks we expect the force to be mediated by one gluon (just like one 
photon in QED) and so the “one gluon exchange potential" should be 


Vi-giuon(T) Y : (18.1) 
However, quarks are evidently confined at separations r > 10713 cm, 
so we must include another term V....,(r) in the potential to account for 
this. No one knows how to compute this from QCD, though some hints 
have emerged from lattice gauge theory*. At the moment we must guess 
it. Some physicists assume V.,(r) ~ r others assume V.,(r) ~ In(r), 
and others V.(r) ~ Vo — any of these match present-day data quite 
well because they don't differ substantially over the narrow range of 
distances probed by current experiments, as shown in fig. 18.2. So we 
can choose 


4 Qs 
Vaco(r) = Vias (Tr) + Veone(") = Ec + For (18.2) 


Experiment indicates [166] that Fo ~ 16 tons! 


Light quarks are relativistic. 


We were able to fit baryon and meson spectra reasonably well using 
SU (3) symmetry and a potential of the above form plus spin-orbit 
coupling, working in the context of a non-relativistic model. But this 
raises a puzzle — not that the quark model doesn’t work perfectly (some 
arbitrary parameters had to be “fit” to the data), but rather that it 
works so well! It’s a puzzle because the binding energies of the light 
(u,d,s) quarks into hadrons are typically a few hundred MeV, roughly 
the same size as the constituent masses of the quarks! This is a highly 
relativistic regime, and in strong contrast to atomic physics, where the 
binding energies are 10’s of eV and the mass of the electron is 511 KeV, 
making a non-relativistic approximation reasonable. At present the 
reasons why the non-relativistic quark model works so well when light 
quarks should be relativistic is not clear. 


If heavier fourth (and fifth and sixth) quarks existed, we might be tempted 


to use SU(4) 


flavor Symmetry (or SU(5)qayor or SU(6)qayor) to explain mass 


splittings. Unfortunately this would not be a good idea because the different 
masses would break the flavor symmetry too badly. However, we might ex- 
pect that for these heavier quarks, relativistic effects will be less important 


*This is an approach proposed by Ken Wilson [164] toward understanding the strong inter- 
actions by modeling spacetime as a lattice of points and then solving the QCD equations 
on this lattice by brute force on a computer [165]. 
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FIGURE 18.2 

Form of the strong interaction potential for four distinct models, each chosen 
to give a best-fit to the spectrum of cc and bb bound states. Image from ref. 
[166]; used with permission. 


(binding energies should be considerably smaller than masses) and that these 
heavy quarks will bind together according to the potential (18.1) to form new 
quarkonium states. These will be analogous to positronium even more than 
the light mesons were. 


In fact there are heavy quarks: charm (discovered in 1974), bottom (dis- 
covered in 1977), and top (discovered in 1995). These particles (and their 
bound states) are all very unstable, and so must be produced in accelerators. 
The most straightforward way is to collide electrons and positrons, as we have 
seen before. If the collision energy is just right the cross-section will rapidly 
shoot up because a resonant bound state is produced. These bound states 
can make transitions to other lower-mass bound states, analogous to the way 
that electrons in hydrogen make transitions. Once the spectrum is empirically 
determined then a potential can be inferred from it. 
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18.1 Charm 


Charm was actually first discovered in 1971 in cosmic ray experiments carried 
out by Niu and collaborators in Japan [167]. They used an emulsion chamber, 
which consists of a sandwich of emulsion sheets that can give an accurate 
tracking of charged particles, followed by another sandwich of emulsion sheets 
alternating with lead plates about 1 mm thick, which allows for detection of 
pions, and an identification of electrons and measurement of their energy. 
Events were seen corresponding to the production of two associated particles 
that each decayed weakly. These particles were called X particles and were 
found to have mass 1.8 GeV under the assumption they were mesons, and 2.9 
GeV if baryons. These masses were far too large to be explained using strange 
quarks. They in fact were charmed hadrons, having all the characteristics we 
now understand for such particles, but were not recognized as such [168]. 

In 1974 simultaneous experiments led by Burton Richter at SLAC [169] and 
by Sam Ting at Brookhaven [170] observed a series of meson resonances. The 
Brookhaven researchers built a spectrometer to search for heavy particles with 
the same quantum numbers as the photon (J = 1,P = —1,C = —1). Such 
particles can decay into ete pairs for the same reasons a virtual photon does. 
'The idea was to search for the reaction 


p+ NS J+X Sette +X (18.3) 


where N is a light nucleus — Be was used in the experiment. This seems 
straightforward enough, but the problem is that since the (at this stage hy- 
pothetical) J is produced by nuclear reactions, the charged particles coming 
out will nearly always be pions instead of ete” pairs. So the spectrome- 
ter must search for particles of opposite charge but reject pions (and other 
hadrons). This was done using (a) threshold Cerenkov counters that could 
detect electrons or positrons, but not hadrons, and (b) a calorimeter that 
measures the longitudinal shower, which would be very different for hadrons 
than for electrons/positrons. A double-arm spectrometer was built, one for 
each charged particle, designed to accurately measure the magnitude and di- 
rection of the momentum of each charged particle. The invariant mass of the 
electron/positron pair is just m? = (pı + p2)?, or 


m (ete ) = y/2m2 + 2E Ez — 2p; p2 cos (01 + 62) (18.4) 


where the angles are relative to the original beam direction. If the J particle 
exists, it will show up as a resonance peak in m (ete”). 

At the same time the SPEAR ete” collider at SLAC was being operated at 
its maximum CMS energy of 8 GeV, collecting data using the Mark I detector. 
Some anomalies had appeared in the data at lower energies, and a decision 
was made to study these by varying the energy in small steps near 3 GeV. A 
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Observation of the J/4» resonance at 3.1 GeV, produced by e*e^ annihilation 
at SLAC. Image courtesy of Institute of Physics Publishing from ref. [171]. 


huge resonance (called the 4) — more than two orders of magnitude larger than 
its surroundings — appeared in all cross-sections, which had the approximate 
form 
2 
FP (E-M? +F 
where M is the mass of the resonance, E the CMS energy, and Tits width. 
'The peak area is 


c (E) 


(18.5) 


6rT,T; 


where T; is the partial width of the initial e* e^ state and T ș the partial width 
of the final state (which might be e*e^, u* u^ or hadrons). The smaller the 


(18.6) 
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width T, the narrower the peak and the larger its area. The SLAC experiments 
indicated that the width was extremely narrow: [ = 0.091 MeV for the 
M = 3097 MeV resonance! Furthermore, these y states were produced from 
ete” — y — y, so the w had to have the same quantum numbers as the 
photon: Jy = 1 P, = —1, C = —1, which we write as Jie —1--). You can 
see the original data in fig. 18.3. 

The extreme narrowness of the y resonances indicated that they could not 
be understood as excitations of bound states of any of u, d, s with any of ü, d, 3. 
Hence a new quark flavor called charm (c) was invented!, and the 4 was 
identified as a cc-bound state called CHARMONIUM. More y resononances 
were discovered (4” at 3686 MeV and y” at 3770 MeV), and could be identified 
with various excitations of this bound state (just the way various states of 
the Hydrogen atom can be identified with excitations of the electron bound 
to the proton). Indeed, the spectrum of states is completely analogous to 
positronium, and the 4 was identified as the 1?S, state — orthocharmonium! 
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FIGURE 18.4 
Quark-flow diagrams for the decay of the ¢. The gluons mediating these 
processes have not been drawn. 


t Actually charm was predicted to exist before the discovery of the y by Glashow, Iliopolous 
and Maini [172], who invented it to explain the absence of strangeness changing neutral 
currents in the weak interactions. We'll look at this in Chapter 21 when we discuss weak 
interactions. After this, Politzer and Appelquist suggested that if a heavy “charm” quark 
existed, it should form a nonrelativistic bound state cc, with a spectrum of energy levels 
analogous to positronium [173]. They called this bound state charmonium. 
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Other combinations of quarks can also form new bound states: cu, cd, c3 
and their antiparticles. These states are called D-mesons, and they are much 
broader than the y resonance (i.e. their lifetimes are much shorter). They 
were observed [174] at the Mark I detector in the channels 


et+e — Do -D- x ette —D*-D -X (18.7) 


where the D-mesons appear as resonances in the final state (seen by decays 
into Kaons and pions). The D°-meson has a mass 1865 MeV, and the D+- 
mesons have mass 1869.5 MeV, each of which are too heavy for 4(3097) to 
decay into: my < 2mp. Hence the v» must decay into states consisting only 
of u, d, and s quarks. 

It is this situation that gives the v its peculiar feature of a very narrow width 
and hence anomalously long lifetime ty ~ 107? sec, 1000 times longer than 
is typical of the strong interactions, but shorter than EM decays (typically 
10718 sec long). 


18.1.1 The OZI Rule 


Why does the y live so long? The answer goes back to Okubo, Zweig and 
lizuka and is called the OZI rule [175]. These scientists were puzzled (long 
before the 4 was found) by the fact that the $ meson (ss recall) decayed 
much more often into 2 Kaons (K+K) instead of 3 ms. But the 37 decay 
is energetically favored (mass(K * K^) = 990 MeV whereas mass(37) = 415 
MeV). 

OZI suggested that processes with unconnected quark lines were suppressed. 
For the decays shown in the diagram 18.4 we see that the decay into three 
pions has unconnected quark lines and so is suppressed. Experimentally 


K^*K- 
ó — —o \ 90% (18.8) 
— mm t 1096 


In terms of gluons, the decays are given in the figure below. 

Why the suppression? In the suppressed diagram on the right in fig. 18.5 
there is an intermediate state of pure gluons. These must be high energy 
(“hard”) gluons since the carry the total mass of the ó meson. However, on 
the diagram at the left, the gluon is of low energy, since most of the energy 
will be in the masses of the strange/antistrange pair of mesons. The OZI 
rule is a statement that high-energy gluon-exchange processes are suppressed 
in meson decay. In the context of QCD we now understand this in terms of 
asymptotic freedom: low-energy gluons couple much more strongly to quarks 
than high-energy gluons do, and so the low-energy gluon exchanges are more 
probable. So most often the ¢ meson will decay via the single-gluon channel, 
but occasionally it will decay with the OZI-suppressed 3-gluon channel. 
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Y 


FIGURE 18.5 
The OZI rule: if any diagram can be sliced in two by cutting only gluon lines, 
the diagram is suppressed. 


Ld y 


FIGURE 18.6 

Why w-decay is suppressed: the OZI-rule favors the diagram on the left, but 
energy conservation forbids this. Hence the diagram on the right describes 
the leading-order decay of the y. 
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Now we can understand charmonium decay. Since all mesons are color 
singlets, the connection between the initial y state and its decay products 
must be via a color-singlet gluon combination which needs at least two gluons. 
But the y couples to the photon and has spin-1, and so must have C = —1. 
Gluons also have C — —1 (they behave just like photons in this respect) and 
so three gluons carrying all of the energy of the y must be exchanged. The 
diagrams illustrating the suppression of decay are given in fig. 18.6. Energy 
conservation forbids the process on the left, and so only the much slower OZI- 
suppressed decay process can occur, giving the w a relatively long lifetime. 


The correlation between states of charmonium and states of positronium is 
remarkably good considering the aforementioned problems with quarkonium. 
You can see the comparison in the figure 18.7. Given the difference in energy 
scales — the hyperfine splitting between the n = 1 levels in positronium is 101! 
times smaller than the corresponding split between the y and 7, particles — 
the relative spacings are remarkably similar [176]. A numerical fit using the 
Coulomb+Linear potential above can be obtained by setting Fo = 16 tons 
= 900 MeV /fm. 
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FIGURE 18.7 

Comparison of the charmonium and positronium spectra[176]. Note the dif- 
fering energy scales. Reprinted with permission. Copyright 1982 by Scientific 
American, Inc. All rights reserved. 
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18.2 Bottom 


In 1977 a 2-arm spectrometer at Fermilab was used by Leon Lederman and 
collaborators [177] to study +” pairs produced in hadronic collisions 


p+ (Cu PD — ut +u +X (18.9) 


where the two arms measured the momenta of the positive and negative 
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FIGURE 18.8 

First evidence for the upsilon resonance from muon-antimuon annihilation. 
The individual states Y, Y’ are not resolved. Reprinted figure with permission 
from L. Lederman, Rev. Mod. Phys. 61, 547 (1989) figs. 12a,12b [178]. 
Copyright (1989) by the American Physical Society. 


muon. As with the Brookhaven experiment these spectrometers must have a 
high hadron rejection rate, which was attained by using a block of Beryllium 
18 radiation lengths thick to stop any hadrons from entering the arms of the 
spectrometer. A target was exposed to more than 10!6 protons, from which 
about 9000 u*u” events with energy m (pt) > 5 GeV were obtained. 
Surprisingly, three narrow resonances (similar to the v») were observed at 
about 9.5 — 10.5 GeV, all with JPC = 1-- as shown in fig. 18.8. These 
could not be identified with any bound states of u,d,s or c and so another 
flavor of quark called 6 (for bottom — though some researchers preferred the 
term “beauty”) was introduced. The lowest-energy resonance was called Y 
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and was identified as a bb-bound state. A precision study at DESY using an 
e*e- collider found that these three resonances had very narrow widths, and 
were the ground and first two radially excited states of a bb-atom. 

The mass of the b quark is so large (~ 4.5 GeV) that even more states of the 
bb-system (upsilonium) should be observed than for charmonium. Of course 
the b quarks can also bind with the u, d, s, c quarks, giving rise to B-mesons. 
The masses of these mesons are now known quite accurately [1], and are given 
in table 18.1. 


TABLE 18.1 
B-Mesons 


Quark Measured Stat (Sys) 


B-meson Width or Lifetime 
content mass (MeV) errors 

Y (13S) bb 9460.30 0.26 54.02 + 1.25 KeV 

Y (23.5, bb 10023.26 0.31 31.98 + 2.63 KeV 

Y (33; bb 10355.2 0.5 20.32 + 1.85 KeV 

Y (425, bb 10579.4 1.2 20.5 + 2.5 MeV 
B* ub 5279.1 1.7(1.4) 1.638 + 0.011 ps 
B? db 5281.3 2.2(1.4) 1.530 + 0.009 ps 
B? sb 5269.9 2.3(1.3) 1.425 + 0.041 ps 
B? cb 6276 4 0.46 + 0.07 ps 


Note that the mass difference between the average value of the two lighter 
B-mesons and the B%-meson is about 90 MeV — precise measurements yield a 
value of 89.7 €: 2.7 1.2 MeV, much lighter than the 150 MeV for the strange 
quark predicted in the old quark model. This value is in agreement with 
theoretical predictions from QCD [1]. Evidently the QCD binding energy is 
a significant contribution to the mass of the low-energy mesons! 


B-meson physics is playing an important role in mapping out much of the 
remaining uncharted territory in the Standard Model. The reason for this 
is that the b-quark can only decay into the four lighter quarks via weak in- 
teractions. These decays (if the Standard Model is correct) violate CP and 
so B-physics provides new independent tests of CP-violation beyond the K- 
meson system. Pll discuss this in more detail in Chapter 19 . 


18.3 Top 


After many years of searching, direct evidence for the existence of the top 
quark was finally obtained by the CDF group at Fermilab. The first evidence 
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Comparison of the charmonium and upsilonium spectra. 


came in April of 1994, when a preliminary measurement of the t-quark mass 
was given as 174+ 20 GeV. This initial result was of questionable statistical 
significance, and so the CDF collaboration presented their work somewhat 
tentatively. However, in early 1995 [179], this sighting of the t-quark was 
confirmed by CDF and by another group at Fermilab called DO. The newest 
results from these two groups have increased the significance of the top quark 
signal to more than 4 standard deviations. The latest measurements are 
my = 171.2+2.1 GeV [1], making the top quark the heaviest known elementary 
particle, roughly twice the mass of the Z boson. 


Observation of the top-quark is quite difficult [180]. Its lifetime is about 
10-24 seconds, and only 1 in every 10 billion collisions produces a top. Top 
quarks are produced at a Tevatron CMS energy of 1.8 TeV, primarily via 
the process pp —> tt. According to the Standard Model, the t-quark decays 
into a W-boson and a b-quark nearly 100% of the time. These events are 
described by diagrams of the form given in figure 18.10 and table 18.2 lists 
the possible decay modes of the top with the associated lowest-order branching 
ratios. Notice that I have listed the branching ratios for the leptons separately 
because they can be distinguished in the final state, unlike the quarks and 
antiquarks, which produce jets. A cartoon depiction of top decay is illustrated 
in fig. 18.11. 

Since the top quark is so unstable, there are no top hadrons. When a lighter 
quark is produced in some process it will move rapidly in a gluon field (that it 
contributes to), whose energy density rapidly becomes so intense that the field 
materializes into quark-antiquark pairs, forming hadrons. This hadronization 
process does not happen for the top quark because its lifetime is shorter than 


Heavy Quarks and QCD 343 


FIGURE 18.10 
A generic top quark decay. 


the hadronization time. So in a certain sense the top quark is a free quark, 
unbound to any others — but it only lives for a very short time! 

'The only way to really observe the top is from its decay products due to its 
weak interactions, which are almost always into a W-boson and a b quark: 


pp _— tx t—W*+b t—=W” +b (18.10) 


'The W-bosons themselves can only be observed from their decay products, 
which are into leptons (3396 of the time) or hadrons (6696 of the time); the 
diagram in figure 18.11 is for a leptonic decay. In the hadronic decay of a 
W —boson, the W decays into two quarks, which then hadronize as jets: large 
numbers of particles detected in narrow but ill-defined cones along the initial 
directions of the quarks. Gluons produced in pp - collisions also make the 
same kinds of jets. These are hard to distinguish from jets produced directly 
by the pp collision. Consequently, leptonic W-decays are much easier to single 
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TABLE 18.2 
'Top Quark Decay Modes 


Decay Mode Branching Ratio 


tE — (ab) (aqb) 36/81 
tt > (ggb) (e.b) 12/81 
tt — (qq7b) (uv,,b) 12/81 
tt — (qd'b)(rv, b) 12/81 
tt => (et vb) (u,b) 2/81 
tt => (et veb)(TD-b) 2/81 
tt > (u*v,b)(rv.b) 2/81 
tt — (e*veb)(ev.b) 1/81 
tt => (u* v b)(uv,b) 1/81 
tt — (TD b) (TDD) 1/81 


In the above q refers to any of u, d, s, c. 
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FIGURE 18.11 

Pictorial depiction of Top-antitop production at the DO Detector at Fermi- 
lab. Image courtesy of Fermilab Education Office, Topics in Modern Physics 
Program. 


out from the background: there is a large, well isolated energy deposited by 
the lepton and missing energy in the transverse direction of the beam due to 
the undetected neutrino. 

Both CDF and DO require a W decay candidate from the top or antitop. 
Events where both W bosons decay leptonically (dilepton events) are clean, 
but only comprise 5% of all tt- decays. Events where one W decays leptonically 


Heavy Quarks and QCD 345 


and the other decays into quarks (lepton plus jet events, or £j events) have a 
higher branching ratio (3096) but suffer from large backgrounds. Figure 18.3 
illustrates a typical top event. 


BOTTOM 
N ANTIBOTTOA 
N JET 


^x BOTTOM! 
ANTIBOTTOM 
JET 


3 METERS 


FIGURE 18.12 

A proton and antiproton collide producing four distinct jets (b) and some 
other particles. Reprinted with permission. Copyright 1997 by Scientific 
American, Inc. All rights reserved. 


How is the mass of the top determined? The best measurements come from 
the £j events, in which the top-antitop decay is expected to have the follow- 
ing products: (i) a single electron/positron or muon/antimuon (ii) missing 
transverse energy from a neutrino (iii) two jets from a W-decay (iv) two b- 
jet decays. CDF begins by selecting only events that have at least four jets. 
Three of them must have at least 15 GeV of transverse energy, and the 4th jet 
at least 8 GeV of transverse energy. Each event that survives this criterion is 
then fit to the hypothesis of tt — single lepton + jets channel. Out of all 
the possible jet assignments, the solution with the best fit (lowest x?) is kept; 
if this fit is too poor (i.e. the x? is too large) then the event is rejected. 

Less than 100 events survive this process. To improve the accuracy, the 
events are then subdivided as to whether or not a b-quark was “tagged” (cor- 
rectly identified). There are either 2 b-tags, one, or none, and the events are 
then regrouped into these categories, and their likelihoods recalculated within 
each group. The product of these likelihoods is maximized to determine the 
t-quark mass. 

Since these original experiments were carried out, further experiments were 
conducted with better data sets. Figure 18.14 illustrates an example of this, 


based on data from a recent run at the CDF detector at Fermilab, which 
obtained M, = 171.9 + 2.0 GeV/c?. 


top 
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FIGURE 18.13 

Analysis of a top event. In fig. 18.3 (a), a proton and antiproton collide 
producing four distinct jets (b) and some other particles. These are sketched 
in figure 18.3 (b). Multiple jets and a positron signify the possible creation 
of a top quark. The energies and locations of the particles measured by the 
calorimeter surrounding the beam line are shown in figure 18.13, with the 
energies released by the particles indicated by the heights of the bars [180]. 
Copyright 1997 by Scientific American, Inc. All rights reserved. 


18.4 QCD 


The theory underlying the strong interactions that binds quarks together is 
called Quantum Chromodynamics, or QCD. QCD is very similar to QED, 
except that the one electric charge (which can be positive or negative) is 
replaced with three color charges — red, blue and green (and their anticolors). 
Table 18.3 contains a comparative chart. 

QCD is a much more complicated theory than QED, mainly because it is 
a non-Abelian gauge theory. While the equations obeyed by the fermions 


(iy (o8, + ig, A2 Ana) = mi) Ug —0 (18.11) 
differ from the Maxwell-Dirac case only by the appearance of A", which inter- 
changes the colors, the gluons obey a generalization of Maxwell's equations 


0^ F5, — g, A" FE fà, = gp w ADE be (18.12) 


pv cb 


where 
Fiy = pAg DA 0 A ALES (18.13) 


v^cb 
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1-tag: 233 events 
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FIGURE 18.14 

Reconstructed top quark mass using a data sample consisting of 233 Lep- 
ton+Jets event candidates, collected by the CDF II detector at Fermilab. 
The darker shaded part is the background distribution, with normalization 
constrained to the calculated value. Reprinted figure with permission from T. 
Aaltonen et al. (CDF Collaboration), Phys. Rev. D79, 092005 (2009) [181]. 
Copyright (2009) by the American Physical Society. 


The general structure of such equations (with Yg = 0) was first worked 
out by Yang and Mills [182] and so we call them Yang-Mills equations. If 
you're curious as to where they come from, I’ve provided some details in the 
appendix. But don't worry about solving them — my purpose here is to show 
you what they look like so that you can see the comparison to QED. In fact, 
finding general solutions to the equations of QCD is the main motivation be- 
hind lattice gauge theory, and is one of the major efforts of strong-interaction 
physics today [165]. 

The equations in the right-hand column of table 18.3 are valid for any Lie 
group, whose structure constants I have denoted by fz,. They form the basis 
of the electroweak gauge theory, where the Lie group is SU(2) and of QCD, 
whose gauge group is the Lie group SU(3). For the particular case of SU(3) 
we have 

[A^ AP] =D (18.14) 
where the generators A* are generalizations of the Pauli matrices and are given 
by 


010 0-20 100 
At = | 100 A=| 300 à = | 0—10 
000 000 000 
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001 00 —i 000 
àt= |000 A=|000 A=|001| (18.15) 
100 ¿00 010 

00 0 ¡ {198 
W= [00-2 à= — |010 
Oi 0 v3 (00-2 


often referred to as the Gell-Mann matrices. Using them, you can easily check 
that the structure constants f" are fully antisymmetric, and that the only 
non-vanishing ones are 


1 fe — V3 


Post taht ata teat có 


along with those that are obtained from antisymmetry, e.g. f°} = -j. 

It is the non-vanishing of the f¿, that make the theory non-Abelian, giving 
rise to new interactions not present in the Maxwell-Dirac theory. Notice that 
the field strength F5, for each gluon is not linear in the gluon wavefunction, 
but depends on the other gluons that are present, even if there are no colored 
fermions. This means that the gluons act as sources for each other, which we 


can see by rewriting the equations for the gluons as 


0" (8, A5 — AAS) = 90 (ADE be + 9,0% (ASAS) ft, (18.17) 
4-9, A* (0, A5 — 8,45 — g, At ASE, ) ft, 


The left-hand side of this equation for each gluon wavefunction Af is just 
like Maxwell's equations. The first term on the right-hand side is the color- 
current due to the quarks (analogous to the electric current v^ in QED) 
— this gives rise to the expects quark-quark-gluon vertex. The A? have the 
effect of changing one quark color into another, both in the quark current 
and in the equation obeyed by the quarks. The remaining terms are the 
“color current” due to the gluons. Notice that there are derivative terms that 
are quadratic in gluon wavefunctions — these yield a triple-gluon vertex rule. 
There is also a term cubic in the gluon wavefunctions (the gp Ast At AS fb f. 
term), from which a four-gluon vertex rule arises. A complete list of Feynman 
rules for QCD is given in the appendix. 


18.4.1 Basic Physical Features of QCD 


So as you can see, QCD is a considerably richer physical theory than QED, 
and considerably more complicated. However, it has some key features that 
are worth noting. 
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1. Eight Gluons 


There are as many gauge fields as there are generators of the group 
SU(3) A unitary 3 x 3 matrix of determinant 1 has 32—1 — 8 
generators, and so there are eight gluons. These eight gluons form a 
color octet: 


|1} = Y |RB+BR) |4) = P |RG-GR) |6)— a |GB+BG) 

2) = DA IRB- BR) 17) = 5 |GB - BG) 

|3) = a RR- BB) |5) -4,|RG-GR) |8) = 2; | RR+BB - 2GG) 
(18.18) 

and we can write the gluon wavefunction as Af, where the color index a = 

|1) ,|2),...,]8). Note that each gluon is its own antiparticle. We could 


also form a color singlet (|s) — 7 |RR + BB+ GG)), which would be 
as common as the photon if it existed. Such a particle would couple to all 
baryons with the same strength, but not to all leptons, and would behave 
like a “fifth force” that would compete with gravity in its behavior on 
baryons, violating the equivalence principle. So far all experimental 
searches for such a force have yielded a null result [183]. 


. Quark-antiquark potential 


The gluon exchange diagram on the right of fig. 18.1 gives a i potential 
for the same reasons as QED. However, the non-abelian nature of the 
color charges gives an interesting answer for the coefficient of this poten- 
tial due to the interaction between a quark-antiquark pair. If this pair 
has its color charges as part of the octet configuration above (e.g the 
color charges have the wavefunction 7 IGB + BG)), then the potential 
is repulsive. But if the color charges are in the singlet configuration 
(i.e. Wi |RR +BB+ GG)) then the potential is attractive! Using the 
rules for QCD in the appendix, one finds 


" ; 2 pore 
Mid [uU (9 )ed "eu (pr Jer| LL 
4 (pi = pi) 


x [B (poe a” Abul) (p3)es] 
2 
EN Is IMD (y yay G0 (i3) (iz) 
=i d ra (p1) (p5) y, u^? (p2) 
A(p, — mi? | | F 
x DES bees] (18.19) 
which yields the 1-gluon exchange potential 
a 1 Qs 

Vi-giuon(") = -42 = z [ef rte] [dos] E (18.20) 


This potential can either be attractive or repulsive, depending on how 
the colors of the quarks are configured. A computation of the quantity 
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j yields 
41 i ] 
_ |j — 3, color singlet — attractive! 
Vi-gluon(") = Gh color octet — repulsive! (18.21) 


Because of this, quark-antiquark pair will bind into a color singlet state, 
giving support to the hypothesis that only color singlets occur in nature. 
3. Confinement 


The preceding arguments suggest that only color singlets occur in na- 
ture, but they don't prove it. They don’t explain why we can't see a 
free quark (or a free gluon). No rigorous proof of confinement exists 


FIGURE 18.15 
Some 1-loop corrections to the 1-gluon exchange diagram in figure 18.1. 


4. Asymptotic Freedom 


In QED we saw that quantum effects modified the QED potential so 
that the coupling constant varied with distance (or energy): 


Vaso(r) == alr 


where a(r) = h T 2am] (18.22) 


which in momentum space is given by eq. (14.51) 


a(q^) = i) (18.23) 


ere) 
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and o(q?) grows as q? increases (ie. the momentum transfer between 
two particles increases, which is equivalent to the distance to between 
the them becoming shorter), with m the mass of the lightest charged 
particle in the theory (in QED this is the electron). 


In QCD the analogous expression is 


As (y) 


E e] 


asa”) = (18.24) 


£)) 


where u? is some reference energy (taken to be zero in QED, but nonzero 
in QCD). The number of colors is N, which is 3 in QCD, and F is the 
number of flavors (6 that we know of for sure). Note that 


11N —2F -12 
E el 3 =71>0 (18.25) 


and we see that as q? increases, o, (q?) decreases! 


'This behavior is called asymptotic freedom — the coupling decreases with 
increasing energy or decreasing distance. This happens because the 
gluons antiscreen the quark. What happens is that along with a fermion 
loop there are now new gluon loops when one considers the exchange 
of a gluon between two quarks (see figure 18.15). This additional effect 
contributes oppositely to the fermion loops, leading to the result in eq. 
(18.34). 


The timely discovery of asymptotic freedom in 1973 by Politzer, Gross 
and Wilczek [184] led to a whole new way of looking at strong interaction 
physics. It meant that theorists could reliably use perturbation theory 
in QCD at sufficiently high energies, since it allows us to treat quarks as 
free particles. It is a basic ingredient in constructing quarkonium, and 
presumably is what is responsible for the OZI rule. 


18.5 Appendix: QCD and Yang-Mills Theory 
18.5.1 Feynman Rules for QCD 


1. NOTATION. Label the incoming (outgoing) four-momenta as p1, po, . .., pn 
(p,,Ppb,.-. Pin) , the incoming (outgoing) spins as 51, $2,..., Sn (81, 89, ..., Sha), 


"m 
the incoming (outgoing) colors with c1,c2,..., C5 Cu c, ..., C) as further 
discussed below, the incoming (outgoing) gluon polarizations as &1,65,... 
(Fes ...), the gluon colors A*, and use labels q1,q2,...,q; to denote the 


internal four-momenta. Assign arrows to the lines as in figure 18.16. 
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FIGURE 18.16 
A typical QCD diagram. 


Time flows from bottom to top, curly lines are gluons, and lines with ar- 
rows are quarks (if they point upward with time) or antiquarks (if they point 
downward against time). Note that no other kinds of fermions interact with 
gluons. 


2. EXTERNAL LINES. Each external line contributes a factor as shown in 
figure 18.17 where the color factors are 


1 0 0 
F=|0 celi c= | 0 
0 0 1 


for red, blue, and green respectively. As in QED, factors associated with 
external lines correspond to the incoming/outgoing plane-wave states, an as- 
sumption that is clearly not valid for confined quarks, but which we will take 
to be valid on the very short timescales in collisions that produce quarks. The 
gluon color factor A® is one of the eight gluon states given in eq. (18.18), e.g., 
45 |GB+BO). 


3. INTERNAL LINES. Each internal line contributes a factor shown in figure 
18.18, where m is the mass of the quark. As before q? 4 m? because the 
particle flowing through the line is virtual (i.e. it does not obey its equations 
of motion). These internal lines are called propagators. 
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External lines in QCD. 
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FIGURE 18.18 
Internal lines in QCD. 


The next two rules are the same as for QED and ABB theory. 


4. CONSERVATION OF ENERGY AND MOMENTUM For each vertex, 
write a delta function of the form 


(ETT (ky + ka + ka +-+ kw) 


where the k’s are the four-momenta coming into the vertex (i.e. each k” will 
be either a q^ or a p”). If the momentum leads outward, then k^ is minus the 
four-momentum of that line). This factor imposes conservation of energy and 
momentum at each vertex (and hence for the diagram as a whole) because 
the delta function vanishes unless the sum of the incoming momenta equals 
the sum of the outgoing momenta. 


5. INTEGRATE OVER INTERNAL MOMENTA For each internal momentum 
q, write a factor 


(27)? 
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and integrate. 


6. VERTEX FACTOR In QCD there are several different vertex factors, 
since gluons carry color. First there is the quark-gluon vertex in figure 18.19 
that is similar to QED. The dimensionless coupling gs = y4tfhcas. There 


FIGURE 18.19 
The quark-gluon QCD vertex. 


are also two vertices for gluons shown in figures 18.20 and 18.21 


k, 


C,K 
k, 
a,u kı 


8 1% 8y (ki Eo), + Bix (ka — ks), + Bui (o = ki), | 


FIGURE 18.20 
3-gluon vertex (all momenta point in). 


The remaining rules are the same as for QED. 


7. TOPOLOGY To get all contributions for a given process, draw diagrams 
by joining up all external points to all internal vertex points in all possible 
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FIGURE 18.21 
4-gluon vertex (repeated indices summed over). 


arrangments that are topologically inequivalent. The number of ways a given 
diagram can be drawn is the topological weight of the diagram. The result is 
equal to —iM. 


8. ANTISYMMETRIZATION Because fermion wavefunctions anticommute, 
we must include a minus sign between diagrams that differ 
(a) Only in the interchange of two incoming (or outgoing) fermions/anti- 
fermions of the same kind 
or 
(b) Only in the interchange of an incoming fermion with an outgoing 
antifermion of the same kind (or vice versa). 


9. LOOPS Every fermion loop gets a factor of (—1). 


10. CANCEL THE DELTA FUNCTION The result will include a factor 


EMT (p+ ph +++ + pi, — Pi — £2 — +++ — Pn) 


corresponding to overall energy-momentum conservation. Cancel this factor, 
and what remains is —iM. 


18.5.2 Yang-Mills Theory 


A Yang-Mills theory is a generalization of Maxwell's theory, in which the 
gauge symmetry group is non-Abelian. The equations can be derived using 
the same reasoning that we used in applying the gauge principle to obtain 
Maxwell’s equations. 

Suppose we have a column vector ® that transforms as 


P = U (1) > P” = U” y gps (18.26) 
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where U is some irreducible unitary matrix representation R of a group that 
in general will depend on position and time. I have put in the indices for the 
representation in the right-hand equation so that the transformation structure 
is clear. We say that forms a representation of the group, by which we mean 
it transforms according to eq. (18.26). The entries in the column vector ® 
can be thought of wavefunctions that transform into each other under this 
representation. For example, if the group is SU(3) and we let ® be a 3- 
component vector, then each entry in ® is a quark wavefunction of a given 
color. 

We want to define a derivative D,, that has the same transformation prop- 
erties as does in eq. (18.26). It must be matrix since ® is a column vector, 
which is why I have written it in boldface. In other words, we require 


Dud) -U(z)(D,9) or (D,9)"—U*,(r)(D,9)" (1827) 


where again I have put in the indices in the expression on the right. We 
also want this derivative to be a linear combination of the usual derivative 
operator plus some vector potential A,,. It is clear that the vector potential 
must also be matrix-valued — otherwise we couldn't have U act non-trivially. 
So let's write 


D, =0,I+igA, or (Dj) 4 = 0,0? + ig (Ay) y (18.28) 


where Oy is the Kronecker-delta function, which is simply another way of 
writing the identity matrix for the representation. It multiplies the partial 
derivative operator because the act of differentiation itself should not mix up 
any of the components of ®. 

Now we want to insert eq. (18.28) into eq. (18.27) and see what happens. 
By definition each side becomes 


0,9' + igA),® =U (x) (0, + igA,®) (18.29) 
or, putting in the indices 
(0,59 99 + ig (AL)" g 0) = U* (0,59. + ig (An) Pe 9*) 


(18.30) 
which simplifies to 


0,97 + ig (A5), D'E —U*,0,99 + igU%y (Au) eE (1831) 
Upon inserting eq. (18.26) this becomes 
; A 
(9,0%) 9? + U* 49,9? + ig (AL) UT 9* 
= U” 40,8" + igU*,, (Ay) e DE (18.32) 


or more simply 


(A)" y US — U* (A). — a (0,U*,) | 5% =0 (18.33) 
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Since we don't want to constrain any of the components of ®, we must have 
the first term vanish, which means 


D € 


(A1) s x U*, (An) o (U~*) B 


or in matrix notation 


y + , (8,U?4) (U~*) (18.34) 


= 1 = 
A), —UA,U 4 E (0,0) (U~") (18.35) 
Notice that if we have an Abelian group, then U = exp(i0(x)) and this 
reduces to eq. (12.14) with g — e. 
We now know how the vector-potential transforms, but we don't know what 
kind of matrix it is. This is actually not too hard to find. Since ® transforms 


under some irrep R, by definition this means that there exist generators TR 
such that 


U (x) = exp (iða (x) TR) and [TR, TR] = iP TR (18.36) 


where the f"? are the structure constants of the Lie algebra. Suppose we 
choose 0, (1) = ĝa to all be constant. Then equation (18.35) becomes 


Al, =A, + iba [TR Au] + (18.37) 


to leading order in 04. Hence we need the matrix-vector potential to also 
be in the representation R; otherwise we won't be able to make sense of the 
commutator in (18.37). The most general thing we can write down is a sum 
over all of the generators, A,, = A TR; and so we have 


D,-—ó,I-igA*T& or (D,)', =0,0%g + igAS (TR) s — (18.38) 


for a given representation. In other words, the choice of matrix A,, depends 
on which representation ® transforms under. For example, if the symme- 
try group is SU(3) and we take ® to be the 3 of SU(3) then (Dis) ie = 
OÍ y + igA;j, (US where the A* are the Gell-Mann matrices (18.15); if 
the symmetry group is SU(2) and we take ® to be the doublet of SU(2) 
then Dis = 010% y + ig Al R where the c* are the Pauli matrices 
(5.31). 

What is the field strength of the vector potential? It's clear that it can't be 
the same as in the Abelian case (i.e., it can't be simply 0, A, — 0,A,,), since 
the U-derivatives in (18.35) have a non-trivial matrix structure. We could 
try to guess at the form of the field strength and then make sure that it had 
the correct transformation properties, but we aren't even sure of what those 
are yet. An easier way is to use the approach of question 6 in Chapter 12 — 
we will compute the commutator of the covariant derivatives acting on ® and 
define what comes out to be the field strength. In other words, we define 


F,,9 = ig[D,,D,] 9 (18.39) 
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for all possible representations R. It's not too hard to show that this gives 
F,, = 0, A, — OA, + ig [ Ay, Av] (18.40) 


or alternatively 
F3, = 0,A5 — 0, A5 — gfe ALAS (18.41) 
where I have used the property of the structure constants that f** got gL 
—fbac — fabe — I 
You can also use (18.39) to show that 


E = UF UT! (18.42) 


under a gauge transformation. Unlike the Abelian case, where the field 
strength was invariant, here we see that the field strength is covariant — 
in other words the field strength itself transforms under a representation 
of the gauge group. In group-theory language, matrices that tranform as 
M' = UMU-! are in the adjoint representation, where the generators are 
the structure constants of the algebra (re)? = if*®) as discussed in ques- 
tion 10 of Chapter 3. So the field strength transforms under the adjoint 
representation. 

Finally, we need differential equations for the vector potentials Af. To en- 
sure that the Abelian case reduces to Maxwell's theory, they must be second- 
order differential equations. They should also be covariant under a gauge 
transformation, which means that we must use covariant derivatives in the 
equation for F,,,. The only equation that satisifies both of these criteria is 


(D! Fp) = JE (18.43) 


where J is called the Yang-Mills current, a generalization of the electric 
current to Yang-Mills theory. Expanding out the left-hand side gives 


DES, — g, A Fb fa, = Je (18.44) 
and it is possible to show from this that 
(DEI) =0 (18.45) 


or in other words that the Yang-Mills current is covariantly conserved. 


18.6 Questions 


1. Draw and label one lowest-order diagram for each of the processes 


(a) T^ +p— A + KO (b) rt +p — X* 4+ K* 
(c) mt +n — r? +p (d) p +p — K7 + Kt 
(ec) ptp—A°+K* +p 
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10. 


. Find the value of R = 
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Use quark-lines in your diagrams to describe all hadronic particles. 


e(e*te- — hadrons) 
c(ete- —pu*pn-) 


at CM energies of 2.5 GeV 
and 5 GeV. 


. For the QCD A-matrices show that Tr (àA!) = 99°" 


. Show that 2 mesons in the color singlet state FA |RR +BB+ GG) ex- 
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perience a potential V = —37. 


. What is the color singlet combination |S) of two gluons? 


. (a) Draw and label the lowest order-diagrams for gluon-gluon scattering. 


(b) Draw and label the lowest order-diagrams for quark-gluon scatter- 
ing. 


. Verify the formula 


2 
ARAR = 205074 — zu) 


for the A-matrices. 


. (a) Given 


F,,9 = ig[D,, D,] 9 


show that 
F,, = 0, A, — OL A, + ig [ Ay, Av] 


or alternatively 
Fiy = ô Ap — OA}, — gig At AS 
(b) Show that for a given representation R 


((D4D, — D,D,) 9) = igF 5, TR® 


. (a) Show that 


A 
F,, = UFU 
under a gauge transformation. 


(b) Show that if 
(D Fu) = JS 


V 


then 
(DEI eg 


Show that any Hermitian 3x3 matrix can be written as a linear combi- 
nation of the identity matrix and the eight A“ matrices. 
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The first observations of weak interactions were made by accident in 1896, 
when Henri Bequerel was investigating phosphorescence [185], the phenomenon 
in which certain materials glow in the dark after exposure to light. He thought 
that phosphorescence might be connected with the glow produced in cathode 
ray tubes by X-rays, and so would place various phosphorescent minerals on 
a photographic plate to see what would happen. There was no effect until he 
tried to expose a compound of uranium salts to the sun, and photographed the 
emission spectrum. He stored the compound and the photographic plates in- 
side a desk drawer. When he returned to develop the plates he found that they 
were overexposed. The plate was deeply blackened, and it soon became clear 
that the uranium compound must have emitted some new kind of penetrating 
radiation quite different from phosphorescence, since the plate blackened even 
when the mineral was in the dark. This was the first observation of natural 
radioactivity, a phenomenon governed by the weak interactions [186]. 

Further exploration showed that radioactivity was a complicated phenom- 
enon. A magnetic field was found to split radioactive emissions into three 
types of beams: alpha, beta and gamma. The alpha rays were seen to carry 
a positive charge, beta rays a negative charge, and gamma rays were neutral. 
Furthermore the alpha particles were much more massive than beta particles, 
as determined from their deflection angle. The alpha particles were eventually 
found to be helium nuclei, the gamma particles photons, and the beta particles 
electrons. 

The beta particles presented a puzzle lasting 30 years. It was clear that 
they were electrons that were emitted not with discrete energies, but rather 
as a continuum, quite unlike the emissions due to other nuclear transitions. 
Why was that? Furthermore, it was clear from experiments in the 1920s that 
there were no electrons inside nuclei. So where do the beta rays come from? 

Pauli suggested that there was a very light, uncharged, and penetrating 
particle that he called the neutron, two years before Chadwick's discovery 
[137]. While it is common today to introduce and discover new particles, this 
was a radical step in 1930, when the electron and proton were the only known 
elementary particles, and Pauli wrote in a letter in December 1930 that “From 
now on, every solution to the issue must be discussed. Thus, dear radioactive 
people, look and judge." Most of them judged negatively, but Fermi was one 
of the few people to take Pauli's idea seriously, and renamed the particle the 
neutrino, or “little neutral one.” Fermi proposed that a neutrino is emitted 
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along with an electron in every beta decay. Both the electron and the neutrino 
share the decay energy, with the electron sometimes having a little energy and 
sometimes a lot, leading to the continuous spectrum. 

Controlled experiments on the very slow decays of certain neutron-rich (or 
proton-rich) nuclei into other, more stable nuclei were able to isolate beta 
emission. For example in the process 


Os 9 Bate +7, (19.1) 


there is the emission of a beta particle (the electron), along with the anti- 
neutrino, which we now know is not the same as the neutrino. The initial 
state (Cs!?9 here) is called the parent nucleus and the final state (in this case 
Ba99) is called the daughter nucleus. Another example is 


C — BY +e Ep, (19.2) 


The atomic weight of the daughter nucleus in both cases is almost exactly the 
same as the parent. 

Radioactive decay will change one nucleus to another whenever the binding 
energy of the product nucleus is larger than that of the initial decaying nucleus. 
The difference in binding energy determines which decays are energetically 
possible and which are not. The excess binding energy appears as kinetic and 
rest mass energy of the decay products. The Chart of the Nuclides (shown 
in fig. 19.1) plots the proton number, Z, of the nuclei against their neutron 
number, N. This chart plots all known nuclei — stable and radioactive — 
naturally occurring and artificially made — along with their decay properties. 
You can see that nuclei with an excess of protons or neutrons as compared to 
stable nuclei will decay toward the stable nuclei either by changing protons 
into neutrons (or neutrons into protons) or by emitting neutrons or protons 
(either singly or in combination)*. 

After the discovery of the neutron, it became clear that all radioactive 
processes could be described by the reactions 


n—p+e + or p—n+et+v (19.3) 


which are referred to as P-decay processes. In terms of quark constituents, 
we understand these reactions to be 


d—ute +7 or u—d+et+re (19.4) 


but we don’t need to look at this level of substructure to understand the basic 
physics of these decays. 


*Nuclei are also unstable if they are excited, that is, not in their lowest energy states. In 
this case the nucleus can decay by getting rid of its excess energy without changing Z or 
N by emitting a gamma ray. 
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FIGURE 19.1 

Chart of the Nuclides, in which horizontal rows represent the same element 
(constant Z) with a variable number of neutrons (N), known as isotopes. The 
black squares represent stable nuclei, the open squares unstable nuclei, and the 
smooth curves are theoretical nuclide stability limits. Image from Radiogenic 
Isotope Geology by A. P. Dickin, Cambridge University Press (1997) [187]; 
used with permission. 


19.1 Fermi's Theory of Beta-Decay 


How can we understand them? Fermi proposed the first theory of P-decay in 
1934 [188]. The decay probability of a nucleus of mass My is given by the 
same decay formula that we've been using all along 


2 3 3 yf 
dr Sc TI c (d Di.) 


— 2hMw |- t 2E; (2x) 


3 
| |(v pops M| p). (27)^ 5 (Es -») (19.5) 


i=1 


except that we now have three particles in the final state: the stable final 
nucleus, the electron and the antineutrino. So we have 


(d?p.) (dp) 
El ES 


© 


dv = - 
16 (27) AMN Ey 


IM|? ô (Eo — E — EL) (19.6) 


where I have integrated out d3p'/y6 (py + p. + pz), which just ensures that 
momentum is conserved between the electron, the anti-neutrino and the fi- 
nal state nucleus N’. The quantity Ey = En — E^ is the energy difference 
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between the parent nucleus and the daughter nucleus. The kinetic energy 


of the daughter nucleus is 4/ Ig, + M2, — My ~ 107? My and so we ne- 
glect it — we regard Eo as being shared entirely between the electron and the 
antineutrino, and Ey ^ MN. The phase space measures are 


dp, = Anpzdp; = Amp E;dE;, dp, = 4rp dpe — (19.7) 


and so 


2 5 dy! 12 
p= p ie Pe e - P—maiMP (19.8) 
(21)? hMy My, Ee 
once we integrate over the ó-function. I have included a non-zero mass for 
the anti-neutrino in the above expression, since we will soon see there is some 
interesting physics here. 

The matrix element M is determined by what we postulate our theory of 
8-decay to be. Fermi's original idea was to take the square of the matrix 

element to be proportional to the energies of the emitted particles, i.e. 


|MP = ELEL F(Z, El) | Mo|" (19.9) 


where an additional factor F(Z, E) — called the Coulomb factor — has been 
included. It takes account of the energy lost (by electrons) or gained (by 
positrons) as they escape from the Coulomb field of the nucleus. Non-relativis- 
tically it is [189] 


2 Ze 
at where 7 — d s 


F(Z, Ee) 


(19.10) 


> exp [- 27] “Arcohv 


for B particles of speed v emitted from a nucleus of charge Ze. Relativistically 
it is complicated to calculate, but can be done exactly. 
Hence we find 


5 2 
C Mol aL (Ey — El) F(Z, E) (En — El? — midp, (19.11) 


dr = 
(3273) AMy My. * 


Now the decay rate is the probability that an electron with energy E = E. 
will be emitted within the momentum range p = pl and p = p, + dp. The 
probability is proportional to the number of electrons emitted within this 
momentum range. Dropping the primes and the subscript “e” we have 


m2 


N(p)dp x |Mol? p? (Eo — EY F(Z, E), |1 - ——2~d 19.12 
(p)dp x |Mo|" p” (Eo — E) F(Z, E) (E Ey (19.12) 


So by plotting Vi vs. the emitted electron energy E, we can test 


this simple picture of P-decay. Such plots are called Kurie plots, and are 
found to be linear for pretty much all radioactive nuclei, indicating that the 
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remaining matrix element factor |Mo|^ is approximately constant and that 
Fermi's idea was on the right track. In fact, experiment indicates two possible 
values for this quantity: 


Gi. if AJ —0 (Fermi transition) 
3G} if AJ =1 (Gamow-Teller transition) 
(19.13) 
where M, is the mass of the proton and Gp is a dimensional constant now 
known as Fermi's constant. It can be found by integrating the Kurie plot 
from p = 0 to p = pmax, since this gives the total decay probability. The 
inverse of this probability is the half-life of the unstable nucleus. A standard 
in nuclear physics is the decay 


|Mol? = (Mp)? x { 


Qs N! + aw, (19.14) 


which yields 
Gp = 1.166 x 107 (GeV)? 


from its Kurie plot [190]. Today the value of the Fermi constant is set by 
accurate measurements of the lifetime of the muon [191], 


Gr = 1.16637(1) x 107? (GeV) ? (19.15) 


a process we will consider in Chapter 20. 

Note that the details of a Kurie plot depend on whether or not the (anti)neu- 
trino has mass. The plots in figure 19.2 show a comparison between the 
straight-line m, — 0 case (dotted line), and what a Kurie plot looks like if 
the neutrino has a non-zero mass. Unfortunately this has not proven to be 
a very good way to determine if neutrinos have mass. P-decay spectrometers 
have finite resolution that is poor near the end point energy E = Ep. At 
present the best limits are given in table 19.1. The upper bound for the mass 


TABLE 19.1 
Neutrino Mass Limits 


Neutrino Upper bound on Mass (in eV) 


Ve 2 
Vy 1.9 x 105 
V. 1.82 x 107 


of v, comes from observations of tritium decay. These experiments actually 
estimate the quantity m and find negative values, albeit consistent with 
zero within limits of error [192]. This illustrates how difficult it is to extract 
the mass from spectrometer experiments via Kurie plots. These experiments 
report m,, « 2.3 eV, a slightly higher upper bound than the Particle Data 
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FIGURE 19.2 
A typical Kurie plot, showing (in exaggerated form) the difference between 
zero and nonzero neutrino masses. 


Group average in table 19.1. The best bounds on the m,,, mass is also lower, 
my, < 1.7 x 10° eV [193] but the best upper bound for the m,, is from the 
ALEPH group [194] and is in table 19.1. 


It is possible to get bounds on neutrino mass from cosmological observa- 
tions. Neutrinos with masses of only a few tenths of an electron volt (eV) can 
significantly influence the formation of large-scale structures (such as galaxy 
clusters) in our universe. By observing galaxy distributions in the context of 
a the best-fit cosmological model, an upper bound of Y m,, < 0.62 on the 
sum of all neutrino masses has been placed [195] using data from the 2 Degree 
Field Galaxy Redshift Survey (2dFGRS), which is the largest existing redshift 
survey [196]. 


We'll return to the topic of neutrino masses in more detail in chapter 25, 
once we've had a chance to study the structure of the weak interactions in 
more depth. 
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————— | 
19.2 Neutrino Properties 
Now let’s use the Fermi theory to compute a cross-section for the process 

Te +p — n+et (19.16) 


We don’t yet have any Feynman rules for this process. Fortunately we don’t 
need them because in the formula for the differential cross section in the lab 
frame (proton at rest) is 


d ny” S|M|? I 
m MEET T um 
dQ 87) Malpil[lp/| (Er + Mac?) — |pi| E, cos 6] 


and so all we need is the matrix element, which from Fermi’s theory is |M|? = 
EE' | Mol? = EE' (mpe) G3., setting the Coulomb factor to unity. Taking 
the antineutrino to be massless, so that |p1| — E, and neglecting the mass of 
the electron so that |p;'| = E”, we get 


do (HN? — M, (Ey G3. 
dà \8r) [(E + Myc?) — Ecos 6] 


(19.18) 


where M» = Mp. For incident antineutrinos in the MeV range, M,c? > E, E' 


and so 3 ; 
da AN Mc CE") G3 E 
~ 1+0 19.19 
do (5) Myc? E Mpe? ( ) 


which implies that 


licG p E' A? 
o~ An (=) c 107% cm? (19.20) 
T 


This number will change by a factor of order unity if the target is a nucleus, so 
we can regard g = 10-7 em? as a typical cross-section for neutrino scattering 
off of any substance. 

This cross-section is almost unimaginably tiny. If you think of a proton 
as a hard sphere, then you might expect it to present an area of mre to an 
incoming particle, where rj ~ 107% cm is the approximate radius of the 
proton. This gives zr? ~ 107% cm”, about 20 orders of magnitude larger! 
The tiny cross-section in eq. (19.20) implies, for example, that the mean free 
path of the antineutrino in water is 


-1 
Ly = (length/nucleus) = (# of nuclei/vol x o) ! ~ (e x 108)° x 0) 
= 10% cm for water (19.21) 
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or 100 light years! Put another way, the probability that a single antineutrino 
will interact with a target of water about a meter thick is 10718. 


This small cross section is the reason why neutrinos are so harmless. To put 
this into perspective, a human body typically contains about 20 milligrams 
of K*°, which is f-radioactive. As a consequence, we emit about 340 million 
neutrinos per day — but we don't even notice it because they almost never in- 
teract with any of our cells! The flux of neutrinos from the sun, the earth, and 
other cosmological sources likewise interacts so infrequently with our bodies 
and all other biomatter that we can proceed through everyday life as though 
they don't exist. Too bad the same can't be said for ultraviolet light! 


However, the small cross section in eq. (19.20) also makes neutrinos diffi- 
cult to detect, because they almost never interact with anything. Remarkably, 
Reines and Cowan [197] observed such interactions in 1959! Dubbing their 
efforts “Project Poltergeist” (because the neutrino is so elusive), they used a 
1000 MW nuclear reactor as a source, which has a flux of about 101? antineu- 
trinos /cm?/sec, as illustrated in fig. 19.3. The target was CdCl and water. 
An antineutrino, upon interacting with a proton, will produce a neutron and 
a positron. The positron rapidly comes to rest via ionization loss and forms 
positronium, which emits y-rays. The water thermalizes the neutrons before 
the cadmium captures them, a process taking several microseconds. Once 
captured by cadmium, the neutrons emit another y-ray. Hence the charac- 
teristic signal for an antineutrino is two ^y-rays emitted microseconds apart. 


Photomultiplier 


Delayed coincident 
detection of y from "og 
with pair of ys from 


Port from Pa: 7 s^ &annihilation. 
nuclear A 
reactor RANI 
Neutrino 
flux 
13 
10 "/em2s : 
Water target with 
scintillator plus 
CdCl 2: 
FIGURE 19.3 


Schematic Diagram of the Reines-Cowan experiment. Image courtesy of the 
Hyperphysics Web site, copyright C.R. Nave, Georgia State University (2005). 
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Nowadays, a modern neutrino experiment like NOMAD [198] detects about 
one neutrino every 10 seconds, depositing on average 27 GeV in the detector. 
For the entire duration of the experiment, neutrinos will have deposited a 
little more than 0.03 Joules — about 1/10 the energy in a sneeze! 

Neutrinos have other unusual properties that were also discovered in the 
1950s and 60s. In 1956 Lee and Yang suggested (before neutrinos had been 
observed in the Reines/Cowan experiment) that the weak interactions did not 
conserve parity [199]. They offered this as an explanation for why the decays 


Kt — ntn? and Kt — «sag? (19.22) 


were both observed - the pion parity is negative and so the final states have 
opposite parities. If parity were conserved then one or the other of these states 
should not be seen. 

In 1957 Wu tested this idea [39] by looking at the decay of Co9? nuclei at 
very low temperature (0.01 °K). Co9? decays via the reaction 


Co Ni” + e7 +p (19.23) 


and one can check parity violation by measuring the expectation value of the 
operator de - p, where J= JJ is the spin of the nucleus and f is the direction 


of the electron momentum. The quantity is a pseudoscalar (P ( : D) = 


= p) and so if parity is conserved, the electron emission rate (which must 
be proportional to d p) should be the same in either direction regardless of the 
spin of the nucleus. This seems straightforward enough to measure, except 
that at room temperature the spins of the nuclei will all be randomly oriented. 
Consequently the experiment must be performed at ultracold temperatures in 
order to minimize this effect. A magnetic field B will split the energy levels of 
the nucleus via the Zeeman effect, so that the energy of a state with magnetic 


quantum number m is E(m) = Eo — gunm E . Ata given temperature 


T the number of nuclei with magnetic quantum number m’ as compared to 
magnetic quantum number m will be 


GLN El 
kgT 


A ( EA n (19.24) 


f 
=e — 

N (m) kat ) dem 
making the most positive magnetic quantum number (e.g., m = +1 fora J = 1 
state) the most likely to be populated. At room temperature the population 
differences are negligible, but at low temperature kpT << gun |B |, only the 
most positive m state will be populated, fully polarizing the nucleus in the 
direction of the magnetic field, as shown in fig. 19.4. The mirror world is 
obtained by reversing the direction of the magnetic field. 

The spin-parity of the cobalt nucleus is JP? = 5+, and the actual decay 
observed is 


|Co™; j =5,m = 5) — [Ni*;j = 4,m = 4) +07 +0 C925] 
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FIGURE 19.4 
B-decay of Co® and its mirror image. 


where the excited nickel nucleus decays to its ground state via the emission 
of two gamma rays 


Ni**99: j = 4, m = 4) — [Ni*%; j = 2,m = 2) + y (1.173 MeV) 
— [Ni%; j = 0, m = 0) + y (1.332 MeV) (19.26) 


where in each electromagnetic decay there is some relative orbital angular 
momentum between the gamma ray and the nucleus. This means that for 
each decay the emission probability is not isotropic but will depend on the 
angle of the y emitted relative to the magnetic field. However, it won't 
depend on a reversal of the magnetic field since electromagnetic interactions 
conserve parity. Hence the degree of polarization of the cobalt nucleus can be 
monitored by measuring the y anisotropy. 

The experimental setup is shown in fig. 19.5. Wu used two Nal crystals 
placed at 90? relative to the long direction of the container and the other was 
placed close to the top (at nearly 0°) to count the emitted y's . The emitted 
electrons are counted by a small anthracene crystal placed just 2 cm above the 
cobalt sample. The magnetic field is switched on to polarize the nuclei, and 
then switched off after a few seconds so that counting of electrons and photons 
can take place. As expected, the same y anisotropy was observed regardless 
of the field direction, indicating the sample was polarized for a few minutes. 
During this time electrons were observed to be preferentially emitted when 
the magnetic field pointed up (i.e. from the north pole of the nucleus) — in 
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FIGURE 19.5 

Setup diagram of the apparatus used measure beta decay in Co9?. The spec- 
imen, a cerium magnesium nitrate crystal containing a thin surface layer of 
radioactive cobalt-60, was supported in a cerium magnesium nitrate housing 
within an evacuated glass vessel (lower half of photograph). An anthracene 
crystal about 2 cm above the cobalt-60 source served as a scintillation counter 
for beta-ray detection. A plastic rod made of lucite (upper half of photograph) 
transmitted flashes from the counter to a photomultiplier at the top (not 
shown). Reprinted figure with permission from C.S. Wu, E. Ambler, R.W. 
Hayward, D.D. Hoppes and R.P. Hudson, Phys. Rev. 105, 1413 (1957). [39]. 
Copyright (1957) by the American Physical Society. 


a mirror world they would be observed to be preferentially emitted from the 
south pole (i.e. when the field pointed down). This must mean that parity 
is violated — the only way that parity could be conserved is if the electrons 
were emitted in equal numbers at both poles. The data from Wu’s experiment 
appears in fig. 19.6. 
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FIGURE 19.6 

Wu’s data showing the preferential counting rate of electrons emitted from 
Cof? nuclei as the sample heats up from 0.01 °K. Reprinted figure with per- 
mission from C.S. Wu, E. Ambler, R.W. Hayward, D.D. Hoppes and R.P. 
Hudson, Phys. Rev. 105, 1413 (1957). [39]. Copyright (1957) by the Ameri- 
can Physical Society. 


So weak interactions violate parity — but by how much? The parity- 
conserving electromagnetic force suggested that the neutrino is the particle 
that must have parity-violating properties. A simple way to quantify this 
is as follows. Consider a spinning particle moving in some direction. If its 
spin is aligned with the direction of motion, we say that it is right-handed; 
if it is anti-aligned, we say that it is left-handed. You can determine this 
by pointing your right thumb along the direction of motion. If your fingers 
curl in the same sense as the particle is rotating, the particle is right-handed; 
if in the opposite sense then it is left-handed (see fig. 19.7). Note that if a 
particle is massive, this is not an invariant concept — by going to a reference 
frame that moves in the same direction of the particle but faster, the sense 
of motion will reverse direction, but not the sense of spin. Hence chirality 
(or helicity or handedness) is an observer-dependent concept for a massive 
particle. However, if a particle is massless, you can never go to a frame of 
reference that moves faster than the particle (since it is moving at the speed 
of light) — and so chirality is an invariant concept for massless particles. 

If parity is violated in weak interactions, then neutrinos of a given helicity 
will be preferentially emitted more than the other helicity. Goldhaber, Grodz- 
nis and Sunyar [200] figured out a clever way to measure neutrino helicity by 
transferring it to a photon via the reaction 


eb ok 
e jJ = y Me 


[Eu ?: j =0,m= 0) + 
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FIGURE 19.7 
Pictorial representation of helicity. 


— [nj = 1, mgm) + 


v; j = jn) (19.27) 


where the Eu'*? absorbs the electron when it is in an S-wave, i.e. no orbital 
angular momentum, a process called K-capture. The excited Sm*!°? nucleus 
decays via 


¡ar = 1, MS") — [Sm*82; j =0,m= 0) +|yj=1,m,) (19.28) 


Angular momentum conservation requires the z-components of the spins to 
balance 
Me = Moy* + m, = My + My (19.29) 


admitting only the following relationships 


+3 -i 1 (19.30) 
d oi 
—-5 +571 
since m, = +1 because the photon is massless. This means that the z- 


component of the neutrino spin — its helicity — is anticorrelated with the 
z-component of the photon spin. 

What one needs to do is measure the z-component of the spin of the emitted 
photons. This was done by placing the Eu!?? sample inside an iron slab in 
a magnetic field B , which orients the electrons in the iron in an opposing 
direction. If the photon spin is aligned with B then the electrons can absorb 
the photons, whereas if it is antialigned then they cannot do so. In this way 
the iron slab acts like a filter that admits only one value of m,. A ring made 
of Sm203 surrounds the detector so that the emitted photons must encounter 
it. The emitted photon — if it has the right energy — will excite the Sm!?? in 
the ring, which upon decay back to its ground state will emit a photon that 
can be detected (at least sometimes) by a Nal detector. However, the photon 
hitting the ring will have the right energy only if the Sm*!°? produced from 
the Eu? is moving in the same direction as the photon, in turn ensuring the 
neutrino is moving in the opposite direction. Otherwise, the recoil energy of 
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the Sm*!? will change the energy of the emitted photon, forbidding excitation 
of the Sm in the ring. In this manner the orientation of the magnetic field 
controls the emission direction of the neutrino. By measuring the asymmetry 
in the number (or intensity) of photons at the Nal detector [i emitted for 


opposite directions of B, one measures T4 = I (my = +1) 
I-I 
R————— 19.31 
I} +I ( ) 


and so in turn measures the handedness of the neutrinos emitted. Since 
the experiment ensures that whenever a photon is detected the neutrino is 
moving opposite to the direction of B , a measurement of R tells us how much 
the neutrino spin is aligned with the direction of motion (right-handed) or 
antialigned (left-handed). 

The experiment showed that all neutrinos were left-handed — there were no 
right-handed neutrinos. In other words the mirror-image of a neutrino does 
not exist! 

So the most natural interpretation of Goldhaber’s experiment is that the 
neutrino is fully left-handed and therefore massless. However, recent obser- 
vations of neutrino oscillations at Super-Kamiokande [41] and the Sudbury 
Neutrino Observatory [42] indicate that the different species of neutrinos can 
transform into each other, a process we will look at in Chapter 25. The sim- 
plest explanation of this phenomenon is that neutrinos have mass, and hence 
both the right-handed and left-handed kinds exist. So — more precisely - if 
there are right-handed neutrinos, they do not interact with any known form 
of matter!, other than their left-handed counterparts into which they can 
oscillate. 


19.3 Kaon Oscillation 


The realization that P is not a symmetry of nature was quite disturbing to 
many physicists. Why should nature be lopsided, choosing between left and 
right? Pauli, for example, was very reluctant to believe it — “I cannot believe 
God is a weak left hander” he said — but after Wu’s experiments, he said “God 
is indeed a weak left hander — the laughter is rightly with the others" [201]. 
But acting with IP on a left-handed neutrino yields the (evidently) non-existent 
right-handed neutrino. It appears that God is completely left-handed! 
However, once the properties of neutrinos were determined (by about 1960), 
it appeared that CP could be a symmetry. Acting with P on a left-handed 


TOf course here I mean non-gravitational interactions — since all neutrinos have energy they 
will all respond to the gravitational influences of other bodies and vice versa. 
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neutrino yields the (evidently) non-existent right-handed neutrino. But act- 
ing with CP yields the right-handed antineutrino, which does exist! Similarly, 
acting with CP on the right-handed antineutrino yields the left-handed neu- 
trino. 

So perhaps God is more subtly ambidextrous, since all other known particles 
also respect this symmetry. For a few years people thought this was the case, 
and that perhaps CP is a symmetry of nature. But in 1964 James Cronin, 
Val Fitch and collaborators observed violation of CP in Kaon decay. This is 
a rather peculiar situation, so we’ll pause to look at it carefully. 

It had been noted in 1955 by Gell-Mann and Pais [202] that a K? (strangeness 
+1) could change into its own antiparticle, the K? (strangeness +1). These 
mesons had been determined to be pseudoscalars by previous experiments. 
Hence 


P[K?) — —|K?) and P|K°) = — |K?) (19.32) 
C |) B | &9) and C | &9) = |K?) (19.33) 
since C interchanges particles and antiparticles. Hence 
CP|x%) =- |K°) and CP|K®) =-|K") (19.34) 
so we can construct eigenstates of CP: 
1 a 1 2 
Kı) = o (|&K?) —|K°)) and |K) = m (|K°) + |K°)) (19.35) 


This provides us with a system in which we can test CP: if CP is conserved 
then |K,) can decay only into eigenstates with CP = + 1, and | K5) only into 


eigenstates with CP =—1. Since Kaons typically decay into pions we therefore 
predict 
Kı — 27,4r,67,... and Kə — 37,57, TT,... (19.37) 
since 
CP |x°n°) = (CP |r°)) (CP |395) = (-1? = +1 (19.38) 


CP rt) =C |rt) P |rt) 2(-0'(-1! 241 — (19.39) 
We can only produce beams of K%s (or K°’s) from other scattering exper- 
iments, which means that the beam is a mixture of Kı and Kə states 
1 
v2 


Since it is easier (more probable) to decay into fewer particles due to the 
available phase space, we expect that the Kı part of the beam will decay 


|K9) = — (1K1) + 2) (19.40) 
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FIGURE 19.8 


Feynman diagram for Kaon oscillation, analogous to coupled pendula as shown 
in the figure underneath. 


away quickly into pairs of pions, and that after a few centimeters the beam 
should be pure Kə. Indeed, experiment indicates 


TK, = (0.8926 + .0012) x 10 ?s Tg, = (5.17 + .04) x 1078s — (19.41) 


Hence a beam of Kaons should decay only into 37's after a few centimeters. 

In 1964 Christensen, Fitch, Cronin and Turlay carried out this experiment 
[203]. By steering a proton beam extracted from the Brookhaven synchrotron 
onto a target, a beam of Kaons and other neutral and charged particles can 
be produced. The charged ones can be deflected away by magnets, leaving an 
undeflected neutral beam. After a few meters only the long-lived K will be in 
the beam, which is unavoidably contamined with neutrons and gamma rays. 
The K$'s enter a volume that ideally should be vacuum, but was actually a 
bag of helium gas that minimized interactions between beam particles that 
could simulate the decay. A two-arm spectrometer, adjusted to anticipate 
possible decays into two pions, was placed on the other side of the bag. Spark 
chambers placed before and after bending magnets allowed measurement of 
the momentum and charge of the decay products. By requiring that the angle 
between the sum of the momenta of the decay products and the beam axis is 
zero and that the mass m (rr) of the two-particle system be compatible with 
the Kaon mass, spurious three-body decays are suppressed. 

Out of 22, 700 Kaon decays, 45 were into 2 pions! The simplest interpreta- 
tion of this experiment is that there is a small amount of CP-violation in the 
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Kaon system. The long-lived Kaon is evidently not pure Ke, but is instead 
mostly Kə with a small mixture of Ky: 


1 
1/1 + lel? 


where e is a measure of nature's departure from perfect CP symmetry. Ex- 
perimentally [1] 


|K) = (e 1) + |K2)) (19.42) 


|. A(Ky — ttr") 

~ A(Ks — ttr?) 

= (2.285 + .019) x 107? exp [144°] = |p, .|e'?" (19.43) 
A (Kr — rro) 

A(Kg — n929) 


= (2.275 + .019) x 1073 exp [i44?] = [noo] e?" (19.44) 


ete’ = ny- 


€ 2e' = Noo = 


where |Ks) = Ju (X1) —e|K2)) is the short-lived orthogonal Kaon 


state. We see that, empirically, the magnitudes of roy and 4. are almost 
the same, and we can set le] = |ņ+-| = .0023 — a small but non-zero effect! 
This value is about 45/22, 700 = .00198, the original Fitch-Cronin result. 
Interestingly, CP violation cannot be explained in the old (pre-1976) Stan- 
dard Model with only four quarks; you need at least six quarks in order to be 
able to explain this effect (you'll find out why in Chapter 21). Other models 
were proposed well before this to explain CP violation, most notably Wolfen- 
stein’s postulated “superweak” interaction: a new force in nature whose 
strength is 10~'° that of the weak interactions [204]. It predicted that e’ = 0, 
and for quite a long time was consistent with experiment. However, the most 


recent experiments [1] have confirmed that Re (2) = (1.65 Æ 0.26) x 1073, 


ruling out the superweak model. 

CP violation is important and interesting to study for several reasons. Al- 
most all extensions of the Standard Model imply that additional sources of 
CP violation exist. Futhermore, a necessary condition for baryogenesis — the 
dynamical generation of the matter-antimatter asymmetry observed in our 
universe — is that there be a sufficient amount of CP violation [8]. However, 
the Standard Model with six quarks fails to account for this asymmetry by 
several orders of magnitude [205], suggesting that there are additional sources 
of CP violation. 

How might such additional sources be found? In general, CP violation may 
be tested by measuring 


NIS 

T+T 
where A is the asymmetry for a given process: the difference between a given 
decay rate and its CP-conjugate divided by the sum of these rates. For Kaons 


(19.45) 
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FIGURE 19.9 
Diagrams for CP violation in the B-meson sector of the Standard Model. 


A is naturally small — A ~ 107? for various Kaon decays. This makes the hunt 
for additional sources of CPP violation difficult because no matter what Kaon 
decay you look at, .A is always going to be this small, making it extremely 
difficult to experimentally check the origin of CP violation as explained in a 
given theoretical framework (Standard Model or otherwise). 


Fortunately the situation is not quite so grim. The standard 6-quark model 
also predicts that CP violation will also occur in the D? D? and B9B? sys- 
tems, each by the same mechanism as the Kaon system. The D? experimental 
physics is messy, but the B-meson physics is “clean”: it is virtually a complete 
analog of the Kaon system, except for its heavier mass and shorter lifetime. 
But not as short as it might have been — the B°-meson fortuitously has a 
relatively long lifetime of abour 1071? sec and a large mixing with its antipar- 
ticle B’-meson [1]. This means that, in principle, A could be of order unity 
for B? decays. This would be consistent with the Standard Model in which 
CP violation arises due to the difference between various quark masses (b, s, d, 
etc.). In contrast to this, the superweak theory (and most other competitors) 
explain CP violation as coming from a new force independent of quark mass; 
hence in these models A should be the same for B°-meson as for Kaons. So 
observation of large values of .A would rule out (or severely constrain) such 
models. 
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Vertex 
Detector 


FIGURE 19.10 
'The BaBar detector, an important detector used to determine the decay prop- 
erties of B-mesons. Image courtesy of SLAC National Accelerator Laboratory. 


The BaBar experiment was the first to observe CP violation in the B-meson 
system [206]. Studying the Y(4s) resonance, from which tens of millions of 
BB pairs are produced, a search is carried out for a CP-violating decay of 
a B-meson (reconstructed from the observed decay products that typically 
contain charm) that is correlated with the recoil of a neutral B-meson (the 
tag). Early hopes were that physics beyond the Standard Model would soon 
be found. 

Unfortunately things weren't quite that easy. Direct observation of CP 
violation in the decay B? —^ K *«- indicates that [1] 


4 eS) -T (B° > Kr) 0.095 + 0.013 (19.46 
Kt+r- = T (B° > K-r+) +T (B° — K*n-) zm . Æ : . ) 


which is rather small effect. However, CP violation has also been observed in 
B — nt +r”, B? — nK*9, and B+ — p? K* decays. Searches for additional 
CP asymmetries in decays of B, D and K mesons continues. So far every 
result obtained has been consistent with the Standard Model. 

It sounds like a “good news" confirmation of the Standard Model. However, 
incorporating these measurements into the Standard Model yields a prediction 
that there is one proton for every 10!? photons due to the cosmic imbalance 
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of particles and anti-particles. This is in disagreement with cosmological ob- 
servations (one proton to 10? photons) by many orders of magnitude [205]. 
Clearly we have a lot to learn about CP violation! 


19.4 Questions 


1. 


How many meters of iron must a v,, of 1 GeV penetrate so that it will 
scatter, on average, once? How long does this take? 


In the decay [Co*; j =5,m= 5) — [Ni**69: j =4,m= 4) te +. 
the intensity of the emitted electrons has the form 


I(8,0) — 1-- a8 cos 


where 0 is the speed of the emitted electron relative to the speed of light 
and 0 is the angle between its direction and the direction of the Cof? 
spin. Neglecting orbital angular momenta in the final state, deduce the 
value of a. 


A K?’ and a K? beam, each of equal intensity, pass through a slab of 
matter. Are the beams attenuated equally? Why or why not? 


A Kə beam passes through a slab of matter. What will the emerg- 
ing beam consist of and why? How can you experimentally test your 
answer? 


You make contact with alien physicists through a wormhole into an- 
other part of the multiverse, and soon develop a common language of 
communication with them. They have developed methods of traveling 
through the wormhole in short times and are considering visiting Earth. 
However, before they visit, you want to be sure that they are not made 
of antimatter. Because of this lack of knowledge, it's too dangerous to 
send objects through the wormhole, but you can ask them any questions 
you want about experiments they have performed, and you are able to 
communicate with them results of experiments performed here. 


(a) Can you determine if any of C, P, or T are conserved in all interac- 
tions in their universe? 


(b) Can you determine if CPP is conserved by C and IP are both violated 
in their universe? 


(c) Can you determine if CPP, C and P are each violated in their universe? 


How many kinds of neutral B-meson oscillation are there? Draw the 
lowest-order diagrams for each. 
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7. The GALLEX experiment measures the flux of v, from the sun by count- 
ing electrons in the reaction 


Ve + (a es e` + Ge”! 


The energy threshold for this reaction is 233 KeV. The expected flux 
of neutrinos from the standard solar model is ® = 6 x 10!4/m?/s. Sup- 
pose for simplicity that the entire flux is above threshold. If detection 
efficiency is 3596, how many Ga"! nuclei are needed to have one inter- 
action with a neutrino per day? What mass of Ga’! does this entail? 
How much mass of actual gallium is needed if the Ga”! isotope is 40% 
abundant? 
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Today Fermi’s theory of beta decay has been superseded by the modern-day 
theory of electroweak interactions. If we remained consistent with the nomen- 
clature for the electromagnetic and strong interactions, we would call this 
theory Quantum Geusidynamics [207], or QGD (“geusi” meaning “flavor” 
in Greek — another possibility is quantum aesthenodynamics (QAD), “aes- 
theno” meaning “weak”). This nomenclature did not catch on, and today 
people simply refer to the theory as Electroweak theory. 


V, V, 
p p 
4 7 
a or g 
r t 
FIGURE 20.1 


General form of leptonic weak interaction vertices. 


All known quarks and leptons undergo weak interactions. The mediators of 
weak interactions are three spin-1 particles: the W+, W7, Z9, the superscripts 
referring to the electric charges of the mediators. They are analogous to the 
photon in QED and to the gluons in QCD. However,, they are strikingly 
different in that they are extremely massive 


Mw = 80.398 + 0.0259 GeV Mz = 91.1876 + 0.0021 GeV (20.1) 


and they are unstable to decay: 


Tw = 2.141 + 0.041 GeV Tz = 2.4952 + 0.0023 GeV (20.2) 


The above masses are world averages of all experimental results [1]. 
“Charged” weak interactions (mediated by the W’s) are simpler than “neu- 
tral" ones (mediated by the Z ) so we'll consider them first. To start with, 
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we'll look at weak interactions of leptons. As with QED and QCD, we can 
describe weak interactions using vertices and propagators. The fundamental 
leptonic weak interaction can be described by the vertex shown in fig. 20.1 
where & =e ,u ,orT and ve = Ve, Vu, Or Vr. Notice that — unlike either 
QED or QCD - weak interactions change one type of particle into another! In 
the case above charged leptons are changed into their neutrino partners. Note 
also that there are no e” /v, or T /ve vertices — leptonic weak interactions 
do not “cross over" between generations! 

The diagrammatic rules are similar to QED, except for two things — the 

1 


vertex factor has a 4 (1— y”), and the propagator has terms that depend 


upon the mass and width of the W, as shown in fig. 20.2. The width only 


V, 
d 
Pa 
__ Àdgy 4 (] — 5 
Ly) 
i 


FIGURE 20.2 
The leptonic vertex factor and internal-line propagator for W bosons. 


becomes important if we are colliding particles at energies at or near the mass 
of the W and so we will neglect it. The constant gw is the “weak charge,” 


with aw = E the weak coupling constant, analogous to a in QED and Qs 
in QCD. 

The factor of 3(1 = 49) is of crucial importance, and must be inserted 
(at least in the lepton sector) because of Goldhaber's observation that all 
neutrinos are left-handed. Recall (from the appendix in Chapter 11) that 


(Py) = N^. (By) (20.3) 
(yy) = det (A) AY, (Voy 55) (20.4) 
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ie. the (U"v)) transforms as a vector, but (U»"4?v)) transforms as a pseu- 
dovector (or axial vector, as it is often called) as indicated by the det (A) 
factor. Since the vertex has both a vector part and an axial vector part — we 
say that the W-boson couples to both a vector current and an axial-vector 
current — it must violate parity. That the parts are equal in magnitude but 
opposite in sign is a consequence of the purely left-handed character of the 


neutrino. For a fermion v = A we have 


;0-39-5 (3 p >y =5 (1-71) ¥=5 Ean (20.5) 


and we see that i (1 — y?) projects out one of the two spinor components (in 
this basis, the y+ x part is eliminated). So a left-handed spinor has the form 
[ = (A 2 . À right-handed spinor is obtained by projection with > (1 + 7) 


1 


jene (E) eee CAE) 


and so has the form y = (S, 


'The W-propagator differs from the usual -it that appears in QED and 
QCD because of the mass of the W's. In almost all reactions (except those at 
LEP and higher-energy machines) q? «& My, so we find 


: Gv — dudv/ Miy - uv 
1 ~ 20.6 
eM, | "PM +ihMwlw| ML 209) 


as a good approximation to the W-propagator*. 
Finally, we still have the condition that the polarization vector €” of a W 
is orthogonal to its 4-momentum: 


e-p=0 (20.7) 


which reduces the number of independent degrees of freedom of a W from 4 
to 3. However, unlike the photon (and gluon), this condition fully exhausts 
the gauge freedom in the model of weak interactions — we do not invoke the 
Coulomb gauge. 


*I should note here that the Breit- Wigner form of the propagator (20.6) is an approximation 
for energies near the mass of the weak bosons, and that the [w term is neglected everywhere 
else (Ty « My in general), yielding the low-energy propagator that follows from the 
equations in electroweak theory. 
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20.1 Neutrino-Electron Scattering 


Consider the simple scattering process e" + v, — p + Ve as shown in fig. 
20.3 


FIGURE 20.3 
Electron-neutrino scattering diagram to lowest order. 


'The matrix element is 
M = (=i)? (5 m" vato (q — 25 
= (—i) 2) [a(p,)y (1 — 7”) u(p1)] 


Suv — dud» / Mj, PANI v 
A [e| [u(p2)7” (1 — y?) u(p2)] 


= (ae) [u(p)y" (1 — 7°) u(p1)] [u(p5) (1 — 7°) u(p2)](20.8) 


The Casimir trick gives, using the trace theorems in Chapter 13 


Y [alp y (1 — 9°) uo] [alo (1 — 59) u(p1)] 


= D [FTE (s +) 0 (eme) 

= Tr | (19) (y, - m) (1-97) (.)] (neglect my.) 
= Te [s^ (10) (y, +m) 0 +7) Y (A) 

-T[*0-) 


2 
* 
= 
| 
E 
al 
bw 
TAS 
3S 
BS 
imn d 
2 
S 
P m 
^w. 
as 
keen d 
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= zy (1- 9°) (y) Y (P) 
= 2m [y (p,) 1 (i) 2m p Qs)" (11) 
= 8 [pipi + pipt — 9"" (pı pi) + ie" "pia p] (20.9) 
where 4^ (1— 45) = 4^ (1— y?) (as is easy to show) and the neutrino has 
been taken to be massless. Hence we get 
wu 64 


4 
g l 
M S ( n ) [p pi + pipt — g"" (pi p) + i£" Pp p; a) 


2/2Mw 
x [pb,¿P2v + P24Pby — Juv (Pa ` Ph) + Enpvo php | 


1 4 
ES (2) [2 (p, - p2) (px : P2) + 2 (p1 ` pa) (P1 - pa) 
HE pr gEnpvo PSPS | (20.10) 


where terms of the form HOV By Di Po Poy vanish, because by momentum 
conservation p3, = —p}, + pay +Piv, in which case the ¢-tensor contracts over 
two identical objects (similarly e^^"? PiaP au = 0). We also have the result 


MOM er ove = —2 (5069 — 6258) (20.11) 


which (as shown in eq. (20.33) ) follows from the definition of the e-tensor. 
Hence 


4 
imp = (4) CEERI o tm op TCI 


2 w 
— 27 Pia pip (0502 — 0207) php? | 


4 
z > Gi) [2 (pi : P2) (px + P2) + 2 (pı : p3) (ri > P2) + 2 (p3 : p2) (Pr ` P2) 
—2 (pi + p3) (93 - pa) 
4 
23 t3 (P - ph) (i =p») ve 


a remarkably simple expression! 
Now let’s look at this in the CM frame, for which 


pi =(E,p) pb-(üp-PB »=(9",-5%)  »f-(.P) 


- E 1 
E e|" - El B o 5 (E+) -5 


M 
(E + |p]) 
RS >/12 
(pi - p2) = E [p] + [p^ (pi: p2) = E |g "| + |p '| 


where M is the mass of the muon. Hence 


mem (20) (retos) (20+) 
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=2 ES Pula (e+ bay? 


- (22) [e+ wy? - ne] e+ ia 
= ES (e VE =e) al (EVE? - m2 + E? — m?) 
~a (B) fa a Eee (20.14) 


where the electron mass m has been neglected in the last line. The formula 
for the differential scattering cross-section is 


do Ei SIMP |5" 
( 


da 87 Ey + Ey) pl 
AD (my e h a (E + |p)? — M? 
8m Mw 4E? 2 (E + |p) |p] 
1/ dug. X os M24)? 


in the limit of large momentum, where in the last line the correct speed-of-light 
factors have been introduced. 

We see here that the cross-section increases with energy, something quite 
different from what we observed in QED and QCD. This is a consequence of 
our approximation of the W-propagator as we shall see. 


20.2 Muon Decay 


Neutrino-electron scattering is really hard to experimentally implement. How- 
ever, muon-decay is an easy experiment to do, and the diagram is given in 
fig. 20.4. 

The matrix element is 


M = (a (2) nt» (55) uo] [2 ete | 


x [u(pz)y" (1 — y?) v(»2)] (20.16) 
This is just like what we had for neutrino electron scattering, except that 


the electron-antineutrino has a v-spinor final state. This is nice because the 
Casimir-tricks don't change at all provided the neutrinos are assumed to be 
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P 


FIGURE 20.4 
Muon decay diagram 


massless. Hence none of the manipulations change, and we get 


MF =2 (4) (pi - pa) (pı - pa) (20.17) 


in other words, the matrix element is proportional to the dot products of the 
lepton momenta with their counterpart neutrino partners. Note the relabeling 
of momenta in fig. 20.4 relative to the diagram 20.3 for electron-neutrino 
scattering. 

Since we are concerned with muon decay, this time we want to look at this 
expression in the rest-frame of the muon. We have 


pt = (pi +p2 +p)" pit =(M,0) pi = (E), p2) 
2 2 
> (p+p1)=m* +2 (p2: p1) = (pi — ph)” = M?—2(p1 : ph) 
1 1 
> (p2: Ph) = 5 (M? = m?) — (pi : p2) = 5 (M? — m?) - ME) (20.18) 


and so 


4 
-— 1 
IM] =2 ES ME) G (M? — m3) — ME’) 


4 
~ (2) ME (M —2E)) (20.19) 
Mw 


where the electron mass m has again been neglected. The decay rate is 
dn A yg 
2E! (2x)? 2E, (2n)? 2E (21)? 


c? 


dU = 
2hM 
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4 (4) (,/ / 

x (27) 6) (py + pa + po — pi) (20.20) 

where now Es = |p2| = E is the energy of the outgoing electron, neglecting 


the electron mass. The 3-Body phase-space integral is done in the appendix, 
and the result from eq. (20.56) is 


4 [2 r2 
Iw M*E*dE ( 4E ) 
dl = 1 20.21 

(4) 2h (An)? 3M? ( ) 


Momentum conservation forces E < iM c? (see why in the appendix), so the 
total decay rate of the muon is 


0 (s) AE a) 
“Jo Mwc) 25 (47)? 3M2 


4 5,2 
= ( Iw ) M : (20.22) 
Mw) 12h (8x) 


for a muon lifetime of 


(20.23) 


1 My \* 12h (8x)? 
ym = 

FU M9w Mc? 
which goes like the inverse fifth power of the muon mass M. 


Note that in both of these problems, the coupling constant gw always is 
divided by the W-mass. If we define 


V2 Iw 3 
= 20.24 
Gross (543) eae) 
then 
1 1925? 
Ty = a (20.25) 


T, GL(Mey 


We can experimentally measure the muon lifetime — it is expt = 2.197 x 107° 
sec. The best measurements |1] then imply 


Gp = 1.16637(1) x 107? (GeV) ? (20.26) 


which (as we saw in eq. (19.15)) is Fermi's constant! This was an early result 
from the Fermi theory of B-decay. Today the muon lifetime provides the 
standard by which the value of Fermi’s constant is empirically determined!. 
Of course in Fermi's original theory [188], there was no W-boson. Weak 
interaction vertices were given by direct 4-fermion couplings of the type shown 
in figure 20.5 but from the modern perspective we understand this diagram 
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FIGURE 20.5 
A vertex in Fermi's original theory. 
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FIGURE 20.6 
Weak interaction theory reduces to Fermi’s theory in the limit of large Mw. 


to be an approximation: That is, Fermi’s theory is reproduced whenever the 
W-mass is much larger than any other energy in the problem! Indeed, the 
idea of a W-mediator was first suggested by Oscar Klein in 1938 [208]. 

Today it is possible to independently measure the mass of the W, and so 
we can deduce the strength of the weak coupling constant gw. We find 


gw = Mw V AV2Gp = 80 x ava x 1.166 x 10-5 = 0.653 


(gw)? (0.653)? 1 
PAWS gr 1257 ^ 0942 gg 2027) 


L e J 
İFermi’s constant is actually defined to be Gp = x (xz) (fic)? = Gr (hoy. 


Throughout this book I will work with the constant Gy, defined so that it has units of 
inverse energy, and (hopefully without confusion) refer to this as Fermi’s constant. 
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which is a surprise — the weak coupling strength is five times larger than the 
electromagnetic strength a = 437 !! 

Weak interactions are feeble — but not because the coupling is aw small. 
Instead, they are feeble because the gauge bosons that mediate the interac- 
tions are very massive compared to all subatomic particles (except for the top 
quark). In LEP experiments — which operated at energies comparable or a 
bit bigger than the mass of the Z-boson — weak interactions were observed to 
be stronger than electromagnetic ones! 


20.8 Appendix: Mathematical Tools for Weak Interac- 
tions 


20.3.1 A Note on the e—Tensor 


The epsilon tensor is a fully antisymmetric tensor (it flips sign under inter- 
change of any pair of indices) that has as many indices as there are dimensions. 
In two spacetime dimensions it is 


Euv = 0 if H=; E€01 = l= —€10 (20.28) 

Note that €?! = —1 because of the minus sign that appears in the metric 
(recall £^" = g"?g"72£ ,,). The product of two e-tensors in two dimensions is 
Pe) = — (6505 — 0561) (20.29) 


which can be shown by brute force: we must get zero if either y = v or a = f. 
To get a nonzero answer clearly a must equal either one of y or v, and 8 must 
equal the other of these. This means that every term on the right-hand 
side must be a product of delta functions between the upper indices and the 
lower indices, since the magnitude of any index set is either 0 or 1. Setting 
u=0 =q and v = 1 = then gives the correct signs. 


The same reasoning can be applied to any dimension. In our (3 + 1)- 
dimensional world the e-tensor must have 4 indices and be fully antisymmetric 
on all of them, and so is given by 


Euvas = 0 if any two indices are equal; £9123 = 1 
Euvaß = Eluvas] = Eval = Evan = —€vopyu = ete (20.30) 


where the square brackets mean “fully antisymmetrize on all indices”, and 
69123 — —], The product of two e-tensors is always either 0 or +1 and so as 
before we will get a product of Kronecker-delta-functions: 


EPY E vag == (1570253 + all possible signed permutations) 
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= — ôf usai] 
= — 67.57.5253 + 050,007 — 56635763 + ete (20.31) 
where there are 24 terms in all. Contracting over the (p, u) index pair yields 
"Te pap = — (467020) — 555253 + 656554 — etc) 
m T SO SY 
= 0,020 y) (20.32) 
Contracting on another pair of indices (e.g., (7, 1)) gives 


PTE prop = tii 


= — (46753 — 625} + 8362 — 625} + 8362 — 48302) 


= -2 (6753 — 6583) (20.33) 
and again 
£716 prog = —2 (8263 — 6953) = —2 (48) 53) = 683 (20.34) 
and finally 
EP gy = —24 = —4! (20.35) 
Note that . 
: 2 V XQ 
VP = iy yy? = Fewap Y Yt? (20.36) 


because the y-matrices all anticommute. 


20.4 Appendix: 3-Body Phase Space Decay 


Consider the decay rate of a body of mass M into three other bodies, as in 
fig. 20.4: 


1 
-2M 


/ dp, d? p) d?pa ¡MP 
2E! (2x)? 2E, (2n)? 2E (21)? 


x (27) 5 (p, + pa + pb — pı) (20.37) 


where we'll assume that m? — 0 (i.e. one of the final state particles is massless) 
so that 


E, =y |P tm Ej=|pJ] E= yP +m? (20.38) 
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Integrating over d*p) gives 


2 d? pd? p» 
E, ELE 


where now E! = 4/ [E + pal? + m2. 


Next let's do the p5-integral. We write the coordinate axes so that py - pa = 
|p | [pa] cos 0 = |p2| E cos 0, which fixes the polar axis along the direction of 
P2. This gives 


| IMP 8 (Ej + Eo + Eh — M) (20.39) 


dp, = [g^ d|p, | sin6d6dà = E? EJ sin 6d6d$ (20.40) 


The $ integration is easy (it gives 27), but the 0 integration requires more 
care because Ej depends on 0, and so the ó-function depends non-trivially on 
0. Explicitly 


E, = J| E? + [po]? +2 |; | E5cos0 + m? = X (0) 
E; dX 
> — sinddd = — — 
E [pa] 
and we can rewrite the 0 integration as an X integration, giving 


a a2 | | Bp» as 


(20.41) 


| | M]? 6 (Ef + E; + E) — M) 


(4x) M E» EAE, 
2(2 d?padE? is 
- en E twr f dX 5(X+E,+E,—M)| (20.42) 
(An) M [p| Ez x- 
where the range of X (0) is bounded by the cosine function: 
X4 = X (cos9 = +1) = y (E; + [io] + m? (20.43) 


This yields 


1 if X_ < M- E- E, < X} 


X+ 
f aXS(X + E+E- M) = |) otherwise 


(20.44) 
Noting that X? > (M — E5 — Ej) > X? we obtain a range for Ef which is 


1 (M? — m? + m2 ME 
E BB. gio L2) : 


(20.45) 


Hence we integrate 


X4 Ey 
n wr f dX 6(X + E2 + E} — M) = ip dE¿ IMP? (20.46) 
xX 
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where |M]? is a function of E? as determined from the diagrams. 
In a 3-Body Decay with one massless particle, we have 


4 
v2 
Wa =2 (LE) (oh pe) (ib) where p = (p) +m er" (2047) 


and conservation of momentum implies 


2 
(p2 + pi) = m$ + mj + 2 (pa - p^) 


= (pı p). = M? - 2 (pı - ph) (20.48) 
1 
(pa 91) => (M? — m2 — m?) - ME, (20.49) 
simplifying the matrix element to 
ME =2 Iw EU. 1 (M? m2 mi) ME! (20.50) 
Mw 2 2 2 1 2 k 
which gives 
E+ 
D dE, | MP 
- 7 
= (4) , ABE [(M? — md — mi) -2MEj] 
guy L 
WwW 
=M ES E (M? — m$ — m2) (E; — El) - ZM (Ef - El) 
4 
=M (4) J(E») (20.51) 
W 


since from (20.45) E+ are both functions of Ez. 
Finally we do the po-integral. There is no further angular dependence, so 
writing 


dpo 
[pa | Ez 


BP py = |po|? d |p3| dQ = |po| ESdEodQ > = dEodQ (20.52) 


and using f dQ = 47, we get 


2 (27) | d?p3 


X+ 
n we f dX 6(X + Ej 4- E, — M) 
X 


(4n)? M |p2| E» 
_ 2 (2n) (47) LB 
ar J dE,M ( A) J(Es) (20.53) 
We can writ 
: db _ E ( 9w ) (20.54) 
dE h(4m)? \Mwe? i 
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as the decay rate per unit energy of particle 2, which is usually taken to be 
the electron. I have put in the factors of A and c, which you can deduce by 
noting that J(E) has units of (Energy)*/c?. 


For the muon decay problem mı = 0, since the 1’ particle is also a neutrino. 
Neglecting the electron mass ma gives 


E nM x og -lxucapESSiw (20.55) 
tUM-EZE ^^ 3 Meno) l 
yielding 
J(E) = L (M?) lmz- (lu.g ' luli (lu E i 
EE 4 2 3 \8 2 
(e 4E 
= ¿ME (1-37) (20.56) 


which is what we obtained for muon decay. 


20.5 Questions 


1. (a) Compute the relative rate for the 7 lepton to decay into a muon as 
compared to an electron. 


(b) Estimate the lifetime of the 7 lepton, neglecting the mass of the 
muon and the electron. How well does this agree with experiment? 


2. (a) Compute the ratio 


o (vy +e — Vu +E) 


v (V,-e Du +e 


assuming that m, « E « My. 
(b) Experimentally R = 1.38*0 24. Use this information to determine 


the value of sin? ðw. How does it compare to the best present-day 
value? 


3. For the scattering process u^ +1, — T. +v,, compute the cross-section 
in the CM frame to lowest-order in the couplings. Do not neglect the 
masses of the neutrinos. 
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4. Consider muonium, a bound state of u* with e^. 


(a) What are the possible spins of the two lowest-energy states? Which 
has the lowest energy? 


b) There are two possible Ways that muonium can decay. What are 


(c) Draw a diagram for each decay mode in part (b). 


(d) Only one decay mode is possible for the lowest energy states. Which 
one is it, and why? 


(e) Compute the ratio of the decay rates for the state in which both 
decay modes are allowed. Which is the more likely decay? 


5. Compute the following for the 3-index Levi-Civita tensor e“: 


(a) £e ef (b) gone ik (c) EE ey (d) eek 


Al 


Charged Weak Interactions of Quarks and 
Leptons 


DOI: 10.1201/9781420083002-21 


'The structure of the weak interactions for quarks is somewhat different than 
for leptons. Before getting to the quark interactions themselves, let's first see 
what we can learn by applying the same methods to the decay of hadrons. 
After studying a few key weak hadronic decay processes, we'll then go on to 
explore the charged weak interactions of the quarks, comparing them to those 
of the leptons. 


21.1 Neutron Decay 


Since we experimentally observe 
n—— pte +0 (21.1) 


we expect that charged weak interactions (mediated by the W’s) amongst 
neutrons and protons are described by the vertex in figure 21.1. 

If the neutron and proton behave like leptons, then we expect the axial 
current coupling c4 to equal the vector current coupling cy and for both to 
equal unity. Of course we only expect this to be valid at low energies, where 
the substructure of the neutron and proton are irrelevant and we can treat 
them both as point particles. The fundamental theory will have a vertex 
containing only quarks as we shall see. For now let's use the vertex in figure 
21.1, treating it as an effective theory, and see how far we get. 

So neutron decay at low energies should (to lowest order) be represented by 
the diagram which is just like the diagram 20.4 for muon decay, with u^ — n 
and Y, — p. Hence we obtain 


M = (~i)? ey (2) [a(p,)y (1 — ey”) u()] ur | 
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P 
Wu 
^ 
P d 
Ww u 5 
=-—fy"(c,-c 
U (c, -c47°) 
n 


FIGURE 21.1 
General form of the neutron-proton-W vertex. 


FIGURE 21.2 
Lowest order neutron decay diagram. 


x [U(p2) yu (1 — 5?) v(v5)] (21.2) 


where e = a The Casimir trick now gives 


Y^ E (1 — ey?) u(p))]! Far)” (1 — e) u(pi)] 


spins 
=4[(1 +) (ppt + pipt — 9"" (pr 91) 
Hiet "Pp pig + (1— €) mampg*”] (21.3) 
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and so we get 


— dps * 
my = 2 GA) [(1 + €?) (ppt + pipt — 9"" (pr - ph) 
-F2ieg "P. api g + (1 — e) mampg"”] 


X [Pou P2v + P2uPo, — Guv (pa ` Ph) + i€upve?3 p3] 
c g 5 
= X |) [2(1- €) ((pi - p3) (p1 pe) + (P1 + P3) (05: P2)) 
4 (Mw 

2i? ce" Pp Di gEnpvo p? DZ — 2 (1 — €) mam (pa: p5)] 

4 

= cy Iw j 2 1 1 2,1 u 
E (1 + €)" (p1 Pz) (p1 + p2) + (1 — €) (p1 + p2) (Pi + p2) 


— (1 — €) mom, (ps : po)] (21.4) 


where as before terms vanish whenever the e-tensor contracts over two iden- 
tical objects and we have used eq. (20.33) 


pov Pe ug = 2 DA = 6207) 


to obtain the last line. We see that if e — 1 (i.e. if the neutron and proton 
have the same vector/axial-vector coupling to the W boson as the leptons do) 
then the matrix element is the same as for the muon decay problem. 

In the rest-frame of the neutron we have p! = (m4,0). Assuming the 
antineutrino is massless, from momentum conservation pl = (p! + pa + pa)” 
we obtain the following relations 


1 
(p2: p1) = 5 (m2 — m2 — m2) — mn E; 
1 
(pa pi) = 5 (mi + me — mp) - Mn ED (21.5) 
(p2: pa) = ; (mz — m; — m2) + mn Es + mn Ez 


which you can show by simplifying relations such as (pa + p1)? = (py — pb)? = 
m2 — 2 (pı : pa). Hence we get 
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which we now insert into the 3-Body Decay rate we computed in the previous 
chapter: 


1 d?p» NE — 
T(n— pte +z.) = Z d E^|.M 21.7 
( p ) (Ary? ] | MA E» =a " | ( ) 


where now 


Hog mia rie) ip (21.8) 
Mn — E F [pa 

with E; = E the energy of the outgoing electron. Note that we can no 
longer neglect the electron mass since mn ~ Mp. Carrying out the integration 
(as is done in detail in the appendix of Chapter 20) gives 


E I(E) / eT si) 
dE (4n)*> \ Mwe 
where 
J(E) = ; (m? — m2 — m2) e? (E? — E?) — sm. (E? — ES) (21.10) 


putting in the correct factors of c. 

To get the total decay rate of the neutron we need to integrate J(E) over 
E. Rather than do this integral in full detail, we can make use of the fact 
that 2r contains a lot of small numbers. Let's define 


ç= in» — 0.001293 6= ™ = 0.000511 
Mn 


n 


ERE b= Vi E - (21.11) 


mac? 


and then approximate 2r to leading order in these small parameters. The 


result is 


a (c) +30 s 


(4x? 


'The last integral is straightforward to do, and we obtain 


2.4302 e 2 z 4 
D(n— pte +%) = (ct ; ci) TE ES (21.13) 
4 2 
x E z NA 14 ln (ee -1) 


where 12.93 
Ç Mn — Mp ; 
E Me 5.11 ot vee) 
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and this gives 


T (n — p+ e7 4 v.) = 6.533 


(c? + 3c) mec? (men 


4 (4x) i V Mw 
— 8.0732 x i94 364) (21.15) 
for a neutron lifetime of 
Tn = 1239 x EM sec (21.16) 


The actual lifetime of the neutron is 72 P^ = 885.7 + 0.8 seconds [1]. It 
is clear that if neutrons and protons couple the same way to the W-boson 
as leptons do (i.e. if cy = c4 = 1) then we get only very crude agreement 
between experiment and theory. In fact, we can independently measure cy and 
ca (from, for example, the decay of 140), and the results are 


cy = 1.000 + 0.003 cA = 1.26 + 0.02 


and the most precise measurements give [209] 
ca/cy = 1.27200.0018 (21.17) 
which means we obtain 
Tn = 1239 x .694 = 859.8 sec (21.18) 


which is better, but still out by about 3%. More accurate calculations have 
today yielded an agreement between theory an experiment to within about 
1%. 

Why can’t we do much better than 1%? And why do cy and ca differ? 
The answer presumably lies with the strong interactions amongst the quarks 
that make up the proton. It is the task of sophisticated QCD-modified quark- 
model calculations — perhaps via lattice gauge theory [210] — to predict these 
values. 


21.2 Pion Decay 


Pions are bound states of up and down quarks, known in the charged case to 
decay into a lepton and its corresponding antineutrino. The decay of a 7^ is 
a weak-interaction analog of positronium decay. We don't know the bound 
state wavefunction for a u and d to form a 7^, so let's write the diagram as 
in figure 21.3. The matrix element is 
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p^ P 


V, 5 


p 


FIGURE 21.3 
Lowest-order pion decay diagram. 


Mc (I2) mew (1 = 98) o04)] Flo)  — eu» 


where the quantity F„(p) is called the pion form factor. 

We are in a similar situation to the one we faced with deep inelastic scat- 
tering: we don't know what the vertex is between the pion and the photon. 
So let's proceed as we did in Chapter 17. By Lorentz covariance F, must be 
a 4-vector. Since it can only depend upon the pion 4-momentum, we must 
have 


Falp) = f (P) p, = f (m2) p, = frPy (21.20) 


since p? = m2. The quantity f; is called the pion decay constant, and must 
be determined by experiment. Summing over the final spin states gives 


E! 
Wat = 8 (08) Y koopa- 59) vob] atop (17) vo] 


E (ie) a [PAY (1 — 1°) (P, + me) 


spins 
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- E GS Y wa ht 


2 4 
= 5 Ea) [2 (P: p3) (P: p1) — P? (we - p4)] CES) 


Assuming a massless antineutrino, momentum conservation implies 


1 " 1 1 
(P Pi) = 5 (2 +74)” — md) = z (1? mé) = 5 (m2 — mi) 
1 
(p: p2) = p? + (BaP) = (Ba Ph) = 5 (mz — mi) (21.22) 
(p: p1) = P? + (ph pi) = m? + 2 (ph pi) = mé 


giving the expression 
—— 3 2 a 
[Ml = mi (v = wi) (20) 


'This is the matrix-element for a 2-Body decay, which is really easy to compute: 


2,3 4 52 242 
T= S |p| ™ = Im; E ) Tv, (1 me) (21:23) 


— B8ulim2c — 256rh (Mw) m2 m2 


From this we learn two things 


1. If me > mz, then IT = 0 because energy and momentum cannot be 
conserved. This is as we expect — the pion cannot decay into any particle 
more massive than itself, such as a 7-lepton. 


2. Much more surprisingly, if mg were to vanish, then the pion could not 
decay either! 


'This last situation is a bit counter intuitive, and merits a bit more inves- 
tigation. Computing the ratio between the decays into the electron channel 
and the muon channel, we find 


2 
2 m? 
2 NL: ( - #4) = 1.28 x 1074 (21.24) 
- E =l. E 
T(m — u TE) mi(i-2 


T (a7 — e + 


m 


which means that the c^ is about 10,000 times more likely to decay into a 
muon than an electron! The experimental value for this ratio is (1.230 + 0.004) x 
107* [1]. Why is this, when phase-space considerations make the lightest par- 
ticle the most probable one to decay into? 

'The reason is that the pion is spinless, so whenever it decays it must emit 
particles that spin in opposite directions. Since the antineutrino is always 
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o => angular - 
momentum 
T 
© weak 
V, y 
FIGURE 21.4 


Final state spins in pion decay as respectively dictated by angular momentum 
conservation and the weak interaction. 


right-handed, this means that the lepton /^ must also come out in a right- 
handed state since it is moving in the opposite direction, as shown in the 
top part of figure 21.4. However, if the mass of the lepton were zero, then 
weak interactions would dictate that the lepton must come out left-handed, in 
violation of angular momentum conservation. The only way to reverse the spin 
is through the lepton's mass, which couples its left-handed and right-handed 
parts. The smaller the lepton mass, the more suppressed this spin-reversal is, 
and so the decay is heavily suppressed for the lighter electron as compared to 
the heavier muon. 


21.3 Quark and Lepton Vertices 


We are now ready to collect together our knowledge of the charged weak 
interactions, putting together in a coherent whole what we know about leptons 
and quarks. 

Let's begin with the leptons. While we know that electrons can couple to 
neutrinos via a W-boson, we have never observed 


LU —W--tw vw—W*-ctr e —W--v, etc. (21.25) 
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This indicates that the lepton/W-boson vertex of the weak interactions, shown 
in fig. 21.5, always couples leptons within a generation: 


e —W tw p —W ctv, T —W + (21.26) 


but never across generations. We refer to this state of affairs as the conser- 


V, V, 
ud ia 
^ ^ 
7 ^ 
P Or 
g p 
FIGURE 21.5 


The charged leptonic weak vertex 


vation of lepton number: “electron-ness” is always conserved in any physical 
interaction (as is “muon-ness” and “tau-ness” ). 

The weak interaction symmetry group (as we will see) is SU(2). This 
symmetry group performs a role in the weak interactions that is analogous to 
the one that SU¿(3) performs for the strong interactions, though with some 
non-trivial subtle features as we shall see. All known lepton wavefunctions 
transform as irreducible representations of this SU(2) group. 

Experiment also indicates that all neutrinos are left-handed. A simple in- 
terpretation of this from a group-theory viewpoint is that the right-handed 
leptons (ie. the right-handed parts of the electron, muon and tau wave- 
functions) are singlets in this group. Physically this means that they don’t 
couple to charged weak bosons. The left-handed leptons (all neutrinos, plus 
the left-handed parts of the electron, muon and tau wavefunctions) group into 
irreducible doublet representations of SU(2): 


colo), ms), (E), mm 
L L L 
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and from eq. (20.5) the Dirac wavefunctions are Y, = z(u — 7) v. It’s 
common in writing the expressions for weak doublets to just use the names 
of the leptons and quarks, and I will continue this practice. But keep in mind 
each entry is really a Dirac wavefunction. 

So a “spin-up” €,-particle is an electron-neutrino; a “spin-down” €r- 
particle is a tau lepton. In other words, what we generally regard as two 
distinct particles, the weak interactions regard as two states of a single parti- 
cle! 

This feature of the weak interactions takes some getting used to. However, 
we have seen something like it before in both electromagnetism and the strong 
interactions. In electromagnetism we have a vertex corresponding to the 
process e — e +y. Since the photon is spin-1, angular momentum 
conservation forces the electron to undergo a spin-flip in this process, for 
example e- 1) — e-(0 +y. We normally don't regard the spin-up electron 
as a distinct particle from the spin-down electron — instead we regard both 
as different states of one particle. In QCD there is a vertex in which a 
quark emits a gluon — for example u — u + g. Again angular momentum 
conservation — and also color symmetry — imply that the quark must undergo 
a spin-flip and that its color must change. So the vertex is really something 
like uP) — uP) 4 gRP — again, we regard u®\ and uP(U as different states 
of the up quark, and not different particles. 

From the perspective of the weak interactions, a lepton and its neutrino 
partner are likewise different states of the same particle. Historically we have 
regarded them as different particles because their masses and charges differ. 
However, as far as the charged weak interactions are concerned, they are 
different states of one particle, with vertices given in figure 21.6. For example, 


€ 

W ^ Yn ap 

P Put "d 
Y ee 
u^ 

€ 

i » 

FIGURE 21.6 


Generalized weak vertices for the electron, muon and tau leptons 


a left-handed Tzr wavefunction, say, emits a W ^ boson and continues on its 
way. In the process it flips from a 7” state to a v, state, analogous to the 
way a spin- 3 particle flips its spin when it emits a photon, or a quark changes 
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color when it emits a gluon. 


409 


What about the quarks? Since we know that the process underlying neutron 


decay 


n pce Ve 


d ute Ve 


(21.28) 


(21.29) 


as shown in figure 21.7 we might also guess that left-handed quarks group 


proton | u du = = 


neutron] udd 


FIGURE 21.7 
Neutron decay in terms of quarks 


themselves into irreducible doublet representations of SU (2): 


u c t ) 
A : (21.30) 
( d ) L ( s ) L ( bJ; 
However, experiment tells us this is wrong because we also observe 
A — pte cV. (21.31) 
which corresponds to 
s— u+e tUe (21.32) 


as is easily seen by looking at the quark content on both sides of this reaction. 


We also know that 
B- — n? +e +7. 
which in quark terms is 
b—ute +h 


(21.33) 


(21.34) 


and that there are many other flavor-changing decays. If this were not the 
case, then we would have three more conservation laws: conservation of 
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“up/down-ness,” “charm/strange-ness” and “top/bottom-ness.” This would 
mean that the lightest strange meson (the K7), the lightest strange baryon 
(the A) and the lightest B-meson would be stable and never decay. We don't 
observe this — so somehow we must take the crossover between generations 
into account. 

We will proceed in stages, following the historical path that led to our 
current understanding. Some terminology will be helpful here. In general 
there are three kinds of weak interaction processes for hadrons: 


1. Leptonic: all decay products are leptons ( e.g. *^ — £^ +Dp) 


2. Semi-leptonic: some decay products are leptons ( e.g. X9 — EF +17 + 
Te) 


3. Non-leptonic: no decay products are leptons ( e.g. A — p + 17) 


These are progressively harder to analyze because of the progressively great- 
er importance of the strong interactions in each case. The leptonic decays are 
the easiest to understand because strong interaction effects are minimized. 
Hence to understand cross-over between generations, the simplest thing to 
look at is the leptonic decay of the Kaon. This is the process Nicola Cabbibo 
concentrated on when he wanted to understand the structure of weak hadronic 
decays. 

In 1963 Cabbibo suggested [211] (shortly after the 3-quark model had been 
proposed) that quark charged weak-vertices were of the form given in figure 
21.8 This makes the u/d/W vertex just like the ve/e/W vertex except for the 
constant factor of cos 0e; likewise the u/s/W vertex is just like the v./e/W 
vertex except for the sin ĝe factor. If 0, were zero, then the weak interactions 
would respect conservation of quark generations. 

Experimentally, the decays of Kaons and neutrons together imply [1] 


6. = 13.04? (21.35) 


and we now call this angle the Cabbibo angle. This small value of ĝe indicates 
that, empirically, we almost have conservation of quark generations — but not 
quite! To see how this works, consider the decay 


KS ys (21.36) 


where (^ = e- or y”. The diagram 21.9 is just like pion decay (fig. 21.3), so 
we obtain 


M - (4 (E) fap (1-45) ote] fap, (21.37) 


as the matrix element. All that is different is that fx replaces fr. Hence 


the decay rate is 
4 2 
pue EN qe ya omg (21.38) 
2567h \ Mw) m3. mi. j 
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FIGURE 21.8 
Cabbibo’s proposed W-boson/quark vertices. 
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The strong coupling strength binding s and ù into a Kaon is presumably the 
same as that which binds d and u into a pion. The only difference should be 
in the weak couplings: cos 6, for the pion and sin 0, for the Kaon. Hence we 


expect 
fK u sin 0. 


== = tan ĝe 
Tx cos ĝe x 


yielding 


212 
DK- — ev) aao ma 7 E) 
T(m — 5 +m) “mr (E 


which for 0, = 13.04? gives 


T(K- — 74») [49 tor =e7 
Tír- —4£--vj) 1:96 for£ = 


and compared to experiment 


pexpt (K- —£ cw. [38 fri E 
Texpt (r= — £- + vj) ~ |134for£ = g^ 


(21.39) 


(21.41) 


(21.42) 
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FIGURE 21.9 
Decay of a Kaon into a lepton and the associated antineutrino. 


we see that there is a crude agreement. The mismatch between the theoret- 
ical and experimental values is due to strong interaction effects, which are 
reasonably well understood [212]. 


21.4 The GIM Mechanism 


Cabbibo’s model worked reasonably well for many strangeness-changing de- 
cays. But it also predicted the decay K? — q^ + p* via the diagram* 
given in fig. 21.10. This seems harmless enough — experiment indicated that 
the branching ratio was 7.3 x 1079 (today this is known more precisely to be 
7.18 + 0.17 x 107? [213]) - but the problem was that Cabbibo's theory pre- 
dicted from the “box” diagram in fig. 21.10 a known quantity T (pi! : m2) from 
a well-determined (and finite) loop integral mulitplied by sin ĝe cos ĝe. This 
theoretical result yielded a quantity much larger than what was observed. 


*There is another diagram with the internal W's crossed over that I have left out. 
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FIGURE 21.10 
Expected lowest order diagram K? —> utu decay. 


More generally there were some other strange features in Kaon decays that 
were not satisfactorily explained by the Cabbibo theory. For example the 
decays 


K* — rt + ve + De K9? — m? tretet (21.43) 


might be expected to proceed with equal rates — after all, there is a similar 
change in strangeness for each. The only difference between them is that the 
lepton charge in the final state is neutral for the K+ decay (a so-called neutral 
current process). Yet experiment indicated that 


BR(K* — at + ve + De) = 1.5 1.3 x 107" 
BR (K° — 7° + ve + e™) = 4.98 + 0.07 x 107? (21.44) 


a difference of many orders of magnitudet. Evidently strangeness-changing 
processes prefer charged leptons in the final state and strangeness-changing 
neutral currents are suppressed — but why? 

In 1970 Glashow, Iliopoulos and Maiani (GIM) proposed that the solution 
to these problems was the existence of a new quark that was a counterpart of 
the up quark [172]. It had the same charge, but orthogonal couplings to the 
down and strange quarks, as shown in fig. 21.11. If this new quark exists, and 


TT've used present-day experimental values [1], but the problem was clear when these mea- 
surements were much less accurate. 
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FIGURE 21.11 
The GIM vertices. 


if its couplings are as in fig. 21.11, then the actual lowest-order diagrams for 
K? — u` + p* are given in fig. 21.12. These diagrams thus add as follows 


2 

M = sin 6, cos eT (pl; mi) — cos 0. sin 6,1 (p; m2) ~ TAF (pt) (21.45) 
where the p^ are the external momenta in the problem. Both diagrams give 
the same integral with m2 replaced with m2, and so (because of the orthogonal 
couplings in fig. 21.11) the integrals cancel up to a factor of “4 my, Setting the 
observed rate equal to the theoretical value yields m, ~ 1.5 Gov. 

GIM thought that this would be a charming solution to the puzzle of K? — 
uT + yt, and so they named the new quark the charm quark. Nearly four 
years later it was discovered [169, 170], and had just the mass predicted by 
the GIM mechanism! 

So in the GIM-Cabbibo scheme, the “correct” quarks to use in the charged 
weak interactions are not d and s, but rather 


u (wu d c fe 
d' EG dcos, + ssin0c } , an s! EG RR e 


46) 
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FIGURE 21.12 
Lowest order diagrams for K? — u* + p”. 


In other words, the “elementary particles" of the electromagnetic and strong 
interactions are d and s, but the “elementary particles" of the weak interac- 
tions are d' and s', where 


i si 
a _ Gay ae (2) (21.47) 

8 y — sin ĝe cos O. 8 J 
and the matrix transforming between the two is called the Cabbibo matrix. 
This mechanism suppresses the puzzling strangeness-changing neutral current 
processes noted above. Since the right-handed parts of the quark wavefunc- 


tions don’t experience the weak interaction, we can make the above redefini- 
tion for these parts too. 


21.5 The CKM Matrix 


Before charm was discovered in the “November Revolution” of 1974, Makoto 
Kobayashi and Toshihide Maskawa turned their attention to the problem of 
CP violation in Kaon decay in 1973 [214]. They wanted to see if a GIM-type 
mechanism could be used to explain it. 

To do this, they needed a complex number in the Cabbibo matrix. However, 
if you put one in, you can always remove it by redefining the phases of the 
down and strange wavefunctions. This is because the most general relationship 
between the weak (primed) eigenstates and the mass (unprimed) eigenstates 
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is given by a 2 x 2 unitary matrix V, whose most general form is 


d! d eiti cosh et(¢1+492) sin 9 d 
(2) pii (5) E [os sin 8 ei (9293) cos g á (21.48) 
which has 4 parameters, as expected for a unitary matrix. You can easily 


check (or deduce!) that this matrix is unitary. However, the 4 parameters are 
not all physically relevant, which can be seen by writing the above as 


d! et% 0 650” "din 10 d 
(2) i E cis) ae ag f cis) (2) (21.49) 


Redefining the quark wavefunctions so that 


0-10 aso 
Geh em 


we see that the only relevant physical parameter is O — the Cabbibo angle! 

In a 6-quark model this strategy does not work because there are not enough 
quark wavefunctions to absorb all of the phases in a 3 x 3 unitary matrix. So 
their idea was to consider a 6-quark model in which the charge (-3) quarks 
that couple to the W-bosons are related to their strong/electromagnetic coun- 
terparts via 


d' Vua Vus Vut d 
s | = | Vea Ves Va | | 8 (21.52) 
bj: Via Vis Vi BJ 


so that the quark weak-doublet structure is 


t 
2 (2), (9), cm 


with diagrams as shown in fig. 21.13. 

From this perspective the weak interactions conserve up/down'-ness, charm/ 
strange'-ness, and top/bottom'-ness! Somewhat analgous to the leptonic case, 
a “spin-up” 6 ¡-particle is a charm quark; a “spin-down” 8,-particle is a b 
quark, which is a mixture of the d, s, and b quarks (and not just a b quark). 

'The matrix relating primed quark wavefunctions to the unprimed ones is a 
unitary matrix known as the CKM matrix (for Cabbibo-Kobayashi-Maskawa). 
It is a 3 x 3 generalization of the Cabbibo matrix. By redefining quark wave- 
function phases, it can be written in the standard form 


Vua Vus Vub Cy $1C3 4 $153 f 
Y= Ved Ves Veb = —8$1C2 C1C2C3 — 8253619 C1C253 + $3c3 6/9 (21.54) 
Via Vis Vib — 5152 €182€3 + C283" C1 $283 — C2c3e"° 
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FIGURE 21.13 
Generalized weak vertices for the down-type, strange-type and bottom-type 
quarks. 


where c; = cos 0;, s; = sin 6;. 

Suppose we set 02 = 03 = 0. It’s easy to see from eq. (21.54) that the 
3rd generation doesn’t mix with the other two, and we recover the original 
Cabbibo matrix, with 6, = 0e. Empirically we do observe mixing of the 
third generation with the other two (in decays of B°-mesons; recall B^ — 
n? + e` + T,), but it is small, as we infer empirically from the relatively 
long lifetime of the B meson and the small branching ratio of decays into 
uncharmed mesons. 

The parametrization of the CKM matrix given in eq. (21.54) is the original 
one used by Kobayashi and Maskawa. It is more common now to use Euler 
angles (012,023,013) and one CP-violating phase (9) [1], in which case 

C12013 $12013 size 
V = | —s12023 — c128938130% C12C23 — $198235130% 823013 (21.55) 
812823 — €12€23813€" —C12893 — 512€23513€/) co3C13 


where again ci; = cos6;; and sj; = sin ij. This form of the CKM matrix 
makes it easy to see how the different generations couple to each other, since 
couplings between quark generations 7 and j vanish if 0;; = 0. 

The values of the four angles in the CKM matrix are constants of nature, 
just the way that the coupling constant of electromagnetism, Planck’s con- 
stant, and the speed of light are constants of nature. These angles tell us how 
strongly the different quark generations couple to each other. They must be 
empirically measured. Experimentally, we know only the magnitudes of the 
matrix values, which are [1] 


Vial [Vis] [Vuo] 
|V| = | [Vea] [Ves] [Vel 
[Veal |Ves| [Veo] 


.97383 + 0.00024 .2272 + 0.0010 .00396 + 0.00009 
.2271 + 0.0010 .97296 + 0.00024 .04221 + 0.00045 (21.56) 
.00814 + 0.00048 .04161 + 0.00012 .999100 + 0.000034 
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We see that, empirically, the third generation hardly mixes at all with the 
other two — 512 >> 893 >> 813. This is why Cabbibo’s original idea worked 
so well. 

'The Standard model does not explain the origin of the values of the entries 
in the CKM matrix. So far there have been no successful extensions of the 
Standard model that have offered a convincing rationale for why they have the 
values they do. At this point in time the best we can do is to experimentally 
check these values for consistency. This is done via something called the 
unitarity triangle, which exploits the fact that the CKM matrix is unitary, 
ie. VIV = 1. For example, the third row and first column entry gives 
(VV), =0, or 

Vud Vun + Vea Ves + Via Ve = 0 (21.57) 
which is a sum of three complex numbers that add up to zero. These numbers 
can be drawn as vectors in the complex plane, and since they sum up to zero 
they form a closed triangle, as shown in fig. 21.14. The relative angles between 
two sides are the arguments of the ratios of these complex numbers 


a = arg (- Vea Vas ) B — arg (- ce ^y = arg (- m) 
Vua Vip Via Vo Voa V, 
(21.58) 
and since it's a triangle we must have a + B + y = 180°. Other unitarity 


(p.m) 


(1.0) 


FIGURE 21.14 
The unitarity triangle. Image courtesy of the Particle Data Group [1]. 


triangles can be defined from the CKM matrix, but the one in fig. 21.14 is 
the most commonly used since its matrix elements are the best known. 
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'The areas of all unitarity triangles are the same, and have half the value of 
the Jarlskog invariant jj, defined as [215] 


Im (Vig Vin Vit Vi) = Id EikmEjin (21.59) 


m,n 


One computes the Jarlskog invariant by simply choosing particular compo- 
nents of the CKM matrix; for example, J = Im (Vaa Ves Vis V¿q). The unitarity 
of the CKM matrix ensures that the value is numerically the same no matter 
what the choice. It is a phase-independent measure of CP-violation, and has 


the experimental value [1] 


3 = (3.08 + 0.17) x 107? (21.60) 


One of the important goals of modern particle physics is to overconstrain 
the CKM matrix. Finding that its parameters do not yield unitarity triangles 
would be a clear signal of physics beyond the Standard Model. 


21.6 Questions 


1. Calculate the decay rate for T^ — 17 +v,. Using the measured value 
for fr and the lifetime of the 7~, what is the branching ratio for this 
decay? How does it compare to experiment? 


2. Find the branching ratio for the decays T —> a” + v, and T ^ — 
K- +0. 


3. In the decay 7” —> u^ + v,, suppose that the weak coupling vertex 
factor is 2521" (1 — e75). 


(a) Calculate the spin-averaged and spin-summed matrix element for 
this process. 


(b) For what value of e is the ratio 


T(r- — pu + £,) 


IT (r7 — e + De) 
maximal? 
(c) How might you determine e from experiment? 
4. Consider the decays D; — 7 +, and D; — u^ + Dp. The decay 
rates can can computed in terms of a decay constant fp,. 
(a) What is the ratio of the decay rates? 


(b) Calculate the expected branching ratio for the decay into 7^ + 7, 
if /p,— 280 MeV. 
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5. For the process in figure 21.15 show that 


FIGURE 21.15 
Neutron decay diagram. 


Y [aria (1 — e") u()]* [ayy (1 — e?) u(p1)] 


spins 
= 4[(1+ ê) (ppt + pip — g” (pı p1)) 
-2iee Pp apa + (1 = e) ma mpg" | 


6. Suppose that the muon neutrino has a mass m. 


(a) Compute the ratio R, where 


T (rt — et + ve) 


R= 


Part pF ty) 
(b) For what value of m is this quantity maximized? For what value 
m of is it minimized? 


(c) Could the experimental value of R be used to determine an empirical 
value for m? Why or why not? 


7. Show that for the momenta defined in figure 21.15 


1 
(p2: p1) = 2 (mz, — m2 — mp) — ma E; 
1 
(p2 : p1) = 5 (mz + me — mp) — mnEz 
1 
(po : p2) = 5 (m2 — m? — m2) + mn E, + ms Es 
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8. Beginning with the expression for the matrix element |M| for the decay 
of the neutron 
2 ( gw y 2 l; o 2 2 
1 / 
= (4) |a + €)" Mp Es G (mé, mz m2) ma E5) 


1 
Y m, Ez G (CA ma Ea) 


I (m2 m2 — m2) 


| mn Ez + 253] 


given in eq. (21.6), find the decay rate of the neutron in terms of the 


parameters 
ee Mn — Mp 5= Me 
Mn Mn 
E = 
2 $= Vm- = [pa] 
Mn 


to lowest order in these parameters. Insert the appropriate factors of c. 
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Electroweak Unification 
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Electroweak theory — sometimes referred to as the Glashow-Salam-Weinberg 
model — is a model that unifies weak and electromagnetic interactions. In other 
words, from the perspective of the Standard Model these two forces are part 
of a more unified whole. The first paper on unification of electromagnetic and 
weak interactions by Glashow in 1961 [216] required such neutral interactions, 
and in 1968 Glashow’s model was modified by Weinberg and Salam [217] to 
include a physical mechanism that made the force carriers (the weak vector 
bosons W and Z) massive. In 1971 Gerard 't Hooft and Martinus Veltman 
showed that this model is renormalizable [218] (with Ben Lee carrying out 
similar work independently [219]), and in 1983 the W and Z particles were 
discovered [73]. All subsequent particle physics experiments have confirmed 
this theory. 

To properly lay out the basic structure of this theory we must first turn 
our attention of an aspect of the weak interactions that I have so far ignored, 
namely neutral currents. The Z boson is the force-carrier of the weak neutral 
force. Let’s begin by looking at its properties. 


22.1 Neutral Currents 


Neutral currents were first postulated by Bludman in 1958 [220], shortly after 
the W-boson was proposed as the force carrier underlying the weak inter- 
actions. He suggested that the W-boson had an electrically neutral partner 
called the Z. Its basic interaction, shown in fig. 22.1, indicates that the Z 
couples to any fermion f. Note that the identity of f is preserved in the 
interaction. There is no vertex for which e” — Z -u or s — Z +t, 
for example. Such processes would not be consistent with low-energy meson 
decays, such as K? — p~ ++. This is what made neutral currents so dif- 
ficult to observe — they would be swamped by electromagnetic effects, which 
are also neutral currents: the photon is electrically neutral!. For this reason 
many people doubted that Bludman's postulate was correct. 

'The first evidence experimentally for weak neutral currents came in 1973 at 
CERN in the Gargamelle bubble chamber, shown in fig. 22.2. The searches 
for neutral currents in previous neutrino experiments had resulted in discour- 
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FIGURE 22.1 
Bludman's general proposal for the coupling of the Z to fermions. 


agingly low limits, and by the late 1960s it was commonly assumed that no 
weak neutral currents existed. People were more excited about the possibil- 
ity of proton substructure. This provoked the question as to what structure 
would be revealed by the W in neutrino experiments, analogous to what the 
photon revealed in deep-inelastic electron-proton scattering. 

André Lagarrigue, André Rousset and Paul Musset worked out a proposal 
for a neutrino experiment that aimed to increase the event rate by an order of 
magnitude, which meant building a large heavy-liquid bubble chamber. The 
key challenge for the experiment was to deal with the unavoidable background 
of events in which a charged hadron leaves the visible volume of the cham- 
ber without visible interaction. This “fakes” a muon event. Events with a 
muon candidate were collected in one category, A, while events consisting of 
secondaries that were all identified as hadrons were collected in a second cat- 
egory, B. Category-B events (referred to as neutron stars (n*) — don't confuse 
them with the astrophysical objects) were thought to arise when undetected 
upstream neutrino interactions emitted a neutron that interacted in the cham- 
ber. It was then easy to deduce from these events the fraction that did not 
interact, thus simulating a muon, and to subtract them from the observed 
number of events in category A. 

If weak neutral currents indeed existed, they would have induced events 
consisting of hadrons only, just as the n*s did, and they would be waiting to 
be discovered as part of category B. The main task was then to find ways of 
distinguishing neutrino-induced events from neutron-induced events. A neu- 
trino beam was formed from decaying pions (T^ — u^ +7, recall). The 
muons were filtered out by a thick iron shield and the neutrino reactions were 
observed in a bubble chamber. The purpose of Gargamelle was to “see neutri- 
nos" by making visible any charged particles set in motion by the interaction 
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FIGURE 22.2 
The Gargamelle heavy-liquid bubble chamber, installed into the magnet coils, 
at CERN in 1970 (copyright CERN; used with permission). 


of neutrinos in the liquid. Neutrinos interact very rarely (recall eq. (19.20)), 
so Gargamelle was designed not only to be as big as possible, but also to work 
with a dense liquid - Freon (CF3Br) - in which neutrinos would be more likely 
to interact. The final chamber was a cylinder, 4.8 m long and 1.85 m wide, 
with a volume of 12 cubic meters. 

Events of the form 


Vua +N — vp +X (22.1) 
D,+N v, +X (22.2) 


where N is a nucleon and X is a hadron were observed [221]. Since there is 
no charged lepton in the final state, such interactions must be due to a weak 
neutral current. By 1973 the number of neutral-current candidates was en- 
couragingly large, as table 22.1 indicates. Spatial distributions of the events 


TABLE 22.1 
Neutral Current Candidates from the Gargamelle 
Experiment 
v-exposure D-exposure 
# of Charged current candidates 102 64 


# of Charged current candidates 428 148 
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suggested that the vertex distribution of the neutral-current candidates is 
neutrino-like, since they were flat like the charged-current events. Further- 
more, there was no indication of an exponentially falling distribution at the 
beginning of the chamber, which is what one would expect if the neutral- 
current candidates were dominantly induced by neutrons. These arguments 
were both corroborated by Monte Carlo simulations. A careful check of the 
neutron-induced background (that took several months to complete) indicated 
that only a small fraction of the neutral-current candidates could be explained 
by neutron-induced events. An independent check exploited the spatial distri- 
butions of neutral current and charged-current candidates, providing further 
evidence that the neutral-current sample was not dominated by n*s. 


'The data indicated that the branching ratios were 


v (v, +N — vy + X) o (V,- N — T, +X) 


—— 2.25 and ~ 45 
o (+N pr +X) c (z, +N — u- +X) 


(22.3) 
which meant that weak neutral-current reactions (due to Z exchange) were 
comparable to weak charged-current interactions (due to W= exchange). This 
meant that they could not be explained as a higher order effect. There was 
also evidence for v, +e — v,-Fe at a similar rate: a purely leptonic 
neutral current event! 

This was good news to theorists, who had years earlier postulated such 
weak neutral currents and used them to unify weak interactions with elec- 
tromagnetism, most notably in what is sometimes called the Glashow-Salam- 
Weinberg model. Together with QCD, this model forms what has come to be 
called the Standard Model. 


22.2 Electroweak Neutral Scattering Processes 


The coupling of fermions to Z's is somewhat more complicated than to W’s. 
The basic vertex is given in fig. 22.3 and contains both vector and axial-vector 
couplings. These depend on the species of fermion f; the values of cl, and ch 
appear in table 22.2. The rather haphazard-looking structure is a consequence 
of electroweak unification as we see in Chapter 23. 

Note that all the couplings depend on another angle, 0w, called the weak 
mixing angle. This angle also relates the weak and electromagnetic coupling 
constants: 


Je 
sin Ow 


gw sinOw=ge=e and gzcosÜw = gw = (22.4) 
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FIGURE 22.3 
The fermion Z-vertex. 
TABLE 22.2 
Neutral Current Couplings to 
Fermions 
j ch 
Ve, Vu, Vr i 
Esu st -i +2sin? Ow 
l 49:2 
u,c,t z — 3 sin® Ow 
1,2442 
d,s,b —3 + 5 sin” Ow 


427 


As with the CKM matrix, the value of the weak angle is unexplained in the 
Standard Model. It must be determined from experiment, which indicates [1] 


sin? Ow = 0.23119 + .00014 => 


OCXP* — 28,740 


(22.5) 


'The Z propagator is similar to that of the W, as shown in fig. 22.4 where 
the masses of the W and Z are related to one another via 


Mz cos Oy = Mw 


(22.6) 


The predictions for the values of gw, gz, Mw, and Mz in terms of the weak 
angle 0w and the electric charge coupling e follow from the electroweak theory 


of Glashow, Salam and Weinberg. 
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FIGURE 22.4 
The Z propagator. 


22.2.1 Neutrino-Electron Neutral Current Scattering 
Before we get to that, we can see how the weak neutral current mediates 


scattering processes since we now have enough information to compute them, 
say v, +e — vy +e . The diagram is given in fig. 22.5 and the matrix 


FIGURE 22.5 
Lowest-order electron neutrino scattering. 


element is 


22 (9zV2 f. 1 1 Juv — IQ / M2 
M = (-if ey (Z) faon" G " 31) u(ps) E =P F iMsTz 
Z 


x [u(p3)" (1 — ey?) u(ps)] (22.7) 
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where e = á, The procedure is completely analogous to that for neutron 
V 


decay 
5 [a)y (1 — e7) u(»2)]" [Eph (1 — ey”) u(p»)] 
spins 
= Tr |» (1 — ey?) (p) y” (1— ey?) (| 
+meTr [y" (1 - e) Y (1-e7")] 
= 4 [(1 +è) (pops + pope’ — gt” (p2 py) + ice?” papa] 
Am? (1 — €?) gi” (22.8) 
and so 


"EV. 4 
——2 c g 
wat = EL ($2) [it 0) o i (on) + = e n ph) (Ha) 
- (1 - ê) mà (ps -34)] (22.9) 
just as before. Neglecting the electron mass, in the CM frame we have 


pi =(E,p) py =(E,—p) and pü-—(Eg) př =(E,-p”) 
where |p? = |p|? = E? and — p-p! = E? cosh (22.10) 


and so obtain 


(pi - pa) = (pi - pa) =2E? 


0 
(pi ph) = (pi -p2) = E? (1 + cos 0) = 2E? cos? 3 (22.11) 
yielding 
— i 0 
IM] ~ 2B (y (22) luxe? (1-0? cost = (22.12) 
Mz 2 


The differential cross-section is 


do _ i IM]? 


dQ 87) (2E) 
En Gy (&y ES ates (1:93 cost d (22.13) 


and integrating over angles yields 


2,642 4 
A ime) ei) ( gz ) [164-2] (22.14) 
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Let's compare this to the result for v, +e” —9 Ve + pp, which we found from 
eq. (20.15) to be 


where m, is the mass of the muon. Neglecting this and integrating over the 


angles gives 
: ( ficgw E (22.15) 
Ww 


Putting it all together we find that 


o (Uy Y el — vy te) 


o (Va + e7 — vet u^) 


(noe)? (ep)? (ri) ter 


2 
247 i (xs) E? 
W 
4 
gz Mw e 4,2 e e e X2 
= (2 xz) [et + (eb) (ea) + (ea) 
W 


| 


Ble wle con 


1 "OW ee ee AN 1\? 
[en 0) ES (-5 250 0%) + (-5) 


4 
— sin? Ow + 3 sinf 0w 


0.09 Theory 
E Tos Expt (22.16) 
The experimental value is 0.08 [222], good to within 10%! 

The reason why it took about 15 years before experiment could confirm 
Bludman’s idea is that at low energies neutral current processes compete with 
electromagnetic ones. An example is given by electron-positron annihilaton 
into some other fermion-antifermion pair, as shown in fig. 22.6. Since the mass 
of the Z is so heavy, the neutral current interaction is very feeble compared 
to the electromagnetic one. That’s why neutrino beams were needed to see 
weak neutral currents — neutrinos don’t couple to photons! 


22.2.2  Electron-Positron Neutral Current Scattering 


Let's consider electron-positron scattering into a fermion-antifermion pair. 
The lowest-order diagrams in electroweak theory are given in fig. 22.6. The 
fermion can be anything except for another electron, since in that case we’d 
have to add two other diagrams. The matrix element for the Z diagram in 
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FIGURE 22.6 
Electron-positron annihilation to some other fermion-antifermion pair to low- 
est order in electroweak theory. 


22.6 is 


` e g " — e Juw —d qu /MZ 
M = Cà eve E [v(p)o" (1 — eq?) u()] q? - M2 + ¡MZ 


x [u(p5)y" (1 — ey?) v(p1)] (22.17) 
where q? = (p2 + pi). We want to work at energies comparable to the mass 
of the Z, and so the modification of the propagator to the Breit-Wigner form 


will be important here, and we will no longer be able to neglect the q,, terms. 
However, note that 


v(p2)»" (1 — e*3?) u(p1)a, = 2m. [v(p2) (ey?) u(p1)] (22.18) 
which you can show using (y, — me) u(pı) = 0 and v(p2) (s, + me) =0. 
Similiarly 

[alpy (1 — 7°) v(p)] a, = 2m; [ulpa (5?) v(PL)] (22-19) 


So the q, terms contribute factors proportional to the electron and fermion 
masses. These are negligible relative to the mass of the Z and so we neglect 
them. 


Hence 


T 2 e 1 
M = Ci eel (E) Bloa)" (1 — e) u(ps)] £-MITiMg 
Z 


x [u(po) yu (1 — ely?) v(p1)] (22.20) 
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and the Casimir trick implies 


€ A 2 
M] = 1 Gel, (5) 
4 la? — M2 +1MzTz| 


2 


Tr |» (1 — e) Pr (1 — ey) A 


xT [yu (1 - 6959) «(0 - 455) 9 
2 
1 CEA 


2 (q? — My + (MzLz)? 


x {(1 + (e?) (1 + (y) [(p1 * P1) (p2 * po) + (pa - P2) (Pi > pa)] 
+4e e! [(p, - p1) (pa p2) — (m1 75) (03: »2))) (22.21) 
where in the CM frame (22.11) 


" CLE TN ((1 + (ey) (1 + (y) (1 + cos? 0) — 8e*e! cos 9) 


|M] = 2 
(2E)? - M3) MD? 


(22.22) 
with 0 the scattering angle. The differential cross section is therefore 


do En (setae). (1?) (1+ e") eost t — sete! cos0) 


2 
dQ 167 (Gy E M) ge (MzLz)? 
(22.23) 
which integrates to a total cross-section of 
2 2 
ey? e 2 f f 
(heg?, E)? [c2 + (cA) | [o t (<4) | 
(22.24) 
48T 


2 2 2 
(2E) » M3) +(MzI'z) 
When mediated by a photon, the cross-section for the same process is, from 
eq. (17.22) 


h 2 2 f 2 
om ee (22.25) 


where Q is the charge of the fermion in units of e. So the ratio is 
az _ a(ete — Z — ff) 
y aleteo 7 fi) 


[4 — 2sin? Ow + 4sinf Ow] (t) i T (ch) | 


(sin? Ow cos? ówQ1)? 
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4 
x E (22.26) 


(Gy E (Mz) + (iz Mzcày? 


inserting the appropriate factors of h and c. 
At energies much less than the mass of the Z (but still much greater than 
the fermion masses) we have 


EV 2 
y [5 — 2 sin? Ow + 4sin4 Ow] [o + (c) | ( E an 
T (sin? Ow cos? Ow Qf) 5 Mze 


and the electromagnetic interaction dominates, since even for E — iM. zc, 


4 
we have (miz) ~ 0.3%. However, when the electron and positron collide 


at exactly the rest-mass of the Z (i.e. at E = ¿Mzc?), we have 


1 2 2 

37 2sin%0 4sin'6 (ct) e M 
e E sin” Ow + Asin w] Cy) ct (CA ( a 
Ty 16 (sin? Ow cos? ówQt)? 


aE ) (22.28) 


which shows that the neutral weak interaction dominates because ic ¡4 ) 
84. 


22.3 The SU(2) x U(1) Model 


The general form of the vertex for fermions coupling to W's is one we looked 
at in the previous chapter and is shown in fig. 22.7 where the wavefunction 
x can be any one of 


V V V. 
€é,=( €), 9*m;-[")]|,8:-[| 7 22.29 
a K ), i D ), 1 E jJ 
t 
Sue ) e, =(%) ».-( ) (22.30) 
d' " s' T y ES 


where in the above ve = V^, po = pe , etc and we recall that wr = 
1 (1-75) v (and Yr = (4 (1— 7°) v) ? = v(1(14-45))). The primed 
quark wavefunctions are related to the unprimed ones familiar from QED and 
QCD via the CKM matrix as in eq. (21.52). What is needed to describe this 
is an underlying theory that accounts for this vertex as well as for the vertex 
for the Z boson in fig. 22.3. 
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FIGURE 22.7 
The general form of the W-boson/fermion vertex. 


However, we will need more - since the WF bosons are charged, they will 
need to couple to the photon — in other words, there should be a WW 
vertex. This suggests that we need a Yang-Mills theory as our underlying 
theory because (as we have seen with the gluons) it provides for vertices with 
3 gauge particles. 

A number of attempts were made to describe the weak interactions using a 
Yang-Mills theory once it became clear from experiments on parity violation 
in the 1950s that the weak nuclear force was quite different from the strong 
one. Many models were proposed over the following two decades until ex- 
periment finally winnowed out all but the one theory we have today, namely 
the electroweak theory of Glashow, Salam and Weinberg. This theory has a 
non-abelian symmetry SU(2) & U(1), with 3 gauge bosons 


(Wi, Wa, Ws) = W, 


associated with the SU(2) and 1 gauge boson called B, that is associated 
with the U(1). Physically the non-abelian symmetry means that, analogous 
to gluon interactions, the W and B bosons will interact with each other as 
well as with the quarks and leptons, resulting in additional vertex rules for 
electroweak theory. 

But the symmetry is “broken” — the lowest-energy state of the theory does 
not reflect the full symmetry of the theory. We will see that the two neutral 
bosons — the Wj; and the B, — mix together to form a massless boson (the 
photon) and a very heavy boson (the Z). The remaining W bosons will 
combine into the charged W= bosons. But how? And why do the W; and B, 
boson wavefunctions get mixed up into the Z-boson and the photon? And 
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why are the W-bosons and Z-boson so heavy, but the photon massless? You'll 
have to go to the next chapter to find out. 


22.4 Questions 


1. Compute the decay widths Tw and Iz, neglecting fermion masses. 


2. (a) Compute the decay rate Z — f + f, where f is any quark or lepton 
lighter than the Z. Neglect the masses of the fermions. 


(b) Find the branching ratio for each species of quark and lepton, as- 
suming that these are the dominant decay modes. 


3. Compute the ratio 94 of quark pair production to muon pair production 
due to e*e^ scattering when the process is mediated by a Zo particle. 
How does it compare to the photon-mediated case at low energies? How 
does it compare at energies equal to the mass of the Zo? Take into 
account the finite lifetime of the Zo. 


4. For electron-positron scattering, as depicted in the diagram in fig. 22.8, 
show that 


FIGURE 22.8 
Diagrams for electron positron scattering in electroweak theory. 


[u(p5)*" (1 — ef y°) v(p1)] a, = 2m; [u(p5)" (e^) v(pi)] 


436 An Introduction to Particle Physics and the Standard Model 
and 
v(p2)" (1 — eq?) u(p1)a, 2m. [v(pz) (ey?) u(p1)] 


5. Draw all relevant Feynman diagrams for the following processes to lowest 
order. 


(a) y+y — uth (b) YAY Z+Z (0W*t-W- — Z+y 


6. Consider the process e+ + e7 — Z — f + f for both right-handed and 
left-handed electron wavefunctions. Compute 


OLOR 


OL OR 


for this process. 


23 


Electroweak Symmetry Breaking 


DOI: 10.1201/9781420083002-23 


The Higgs mechanism provides a means for the weak boson gauge fields of 
electroweak theory to acquire their mass. The gauge group is SU(2) x U(1), 
which means the equations of the theory have two kinds of gauge fields: the 
three W; wavefunctions, which obey a set of Yang-Mills equations, and the B, 
wavefunction, which obeys a Maxwell equation. This is kind of like a mixture 
of QCD and QED, except that (a) in addition to fermions we will also have 
the scalar Higgs wavefunction and (b) the symmetry will be spontaneously 
broken. 

Before considering how this mechanism works in the full electroweak theory, 
it will be helpful to consider how it works in the U(1) subsector of the theory. 
Let’s begin there. 


23.1 The Higgs Mechanism 


The idea that the weak bosons had to be very heavy was considered as long 
ago as 1961 when Glashow first suggested that the weak and electromagnetic 
interactions were unified [216]. The disparity in strength between the two 
interactions could be accounted for if the weak bosons had large masses. But 
nobody knew how to properly do this. 

The problem is not that we don’t know how to give a particle a mass — for 
example in the Klein-Gordon equation 


(9,0 +m?) ¢ =0 (23.1) 


the (mass)? term appears as a constant multiplying the wavefunction — indeed, 
we can regard this as a definition of mass. However, if we do this for a gauge 
boson, say the B,,, then we would modify eq. (12.27) to read 


0" (0, By — 0,B,) + MÈB, = jx (23.2) 
which would give the B,, a mass but destroy the U(1) gauge invariance of 


our theory (recall the discussion in section 12.4.1). This in turn renders the 
theory unrenormalizable, and therefore of no predictive power. So it appears 
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that if we want to give a mass to a gauge boson, we destroy renormalizability 
of the theory! 

The resolution to this dilemma was found by Higgs in 1963 [223], and then 
applied in 1967-8 to the weak interactions by Salam and Weinberg [217]. It 
involves introducing a scalar particle that couples in a particular way to the 
electroweak bosons and that has a non-trivial ground state. 

To see how this works, consider first a modification of the Klein-Gordon 
equation (23.1): 

0,0" 6 + V' (9) 2 0 (23.3) 


where we can regard V (¢) as the potential energy of the ¢ particle; V’ ($) = 
e is like a self-force that acts on $. For example, if V(¢) = $m?¢? 
then we obtain (0,,0" + m?) $ = 0, which is the Klein-Gordon equation for a 
massive particle. In this sense we can understand mass as being like a spring 
constant in a harmonic oscillator potential! 

More generally, we can make the potential anything we want it to be, but 
renormalizability demands* that it grow no faster than œt. Hence we can 
write the most general renormalizable potential as 


V (P) = Vo + Vid + Vad? + Va? + Vid" (23.4) 


The constant term Vo sets the zero of energy; we can remove it if we like, 

though we won't do so here. The cubic term can be removed by redefining ¢ 

by a constant shift: ¢ = $- 4. The linear term simply adds a constant to 

the modified Klein-Gordon equation, so for simplicity we'll set Vi = 0. 
Hence we take 


1 1 
V(¢) 2 Vo — 3 d a i^ (23.5) 

which yields 
ð p — u$ + A9? — 0 (23.6) 


as the general form of our modified Klein-Gordon equation. 

Although an exact solution to this equation (for arbitrary boundary condi- 
tions) would tell us everything we want to know about ¢, we don't know how 
obtain such a solution. Fortunately this is not necessary. We can consider 
perturbative solutions by writing $ (x) = do + ¢ (x), where do is the lowest- 
energy state (i.e. the ground state) of ¢, and $ is a wavefunction describing 
small excitations of ¢. Inserting this into eq. (23.6), to leading order in $ we 
have 

0,0" + (-u? + 32745) ó +A? — i^o ~ 0 (23.7) 

But what should we take for fp? If we want to expand about the ground 


state, we need to expand about the minimal value of V (4), i.e. the value 


* A proof of this is beyond the scope of this textbook; you'll have to take my word for it or 
else read a more advanced book on the subject [97]. 
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(x) = dy such that V'(óo) = 0. One solution is clearly dp = 0; however 
this gives 
0,0" — po ~ 0 (23.8) 


which is a Klein-Gordon equation with the wrong sign for the mass term. 
This is strange — the excitations have imaginary mass!, so the momenta of 
a $ particle would be larger than its energy, implying that relativistically it 
would move faster than light! However, upon closer inspection we see that 
do = 0 is really a local maximum of V (¢), so we perhaps should not be 
surprised that these excitations are unphysical. 

Another solution to V’ (Pp) = 0 is do = +4; this gives 


8,0" $ + 2u*ó ~ 0 (23.9) 
which is a Klein-Gordon equation for a particle of mass V2u! So excitations 
about $ (1) = +% are (to leading order) those of a free (and massive) Klein- 


Gordon particle! The full equation for $ is 


8,0" à + 2u?à + 3uA? + M? = 0 (23.10) 


and solving this equation (say perturbatively in A) will give us the wavefunc- 
tions corresponding to states excited above the ground state do = 5. Of 


course we could have chosen fp = —£ as the ground state of the system, in 
which case 
0,0" 9 + 2u?0 — 3uMP? + 93 = 0 (23.11) 


describes the excitations. 

Note that there is no new physics here — the equation for à represents 
exactly the same physical system as the equation for ¢. However, the $ 
version is the one better suited for describing excitations about the ground 
state of the system. 

The phenomenon described above is referred to as spontaneous symmetry 
breaking because it breaks the original symmetry of the theory. You can see 
that 


1 1 
V (9) = Vo — 3 d zb i^ (23.12) 
is invariant under $ <— —4@, but that 
vhent (E, sure rr tet exis 
HEC = 4 


is not invariant under à — —é. This happens because the ground state of 
the system does not respect the original symmetry of the theory. It is caused 


TThey are known as tachyons in the literature. 
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by the system tending to its lowest-energy state with nothing external to the 
system being responsible (hence the term “spontaneous” ). 

We can now make use of this phenomenon to give a mass to the gauge 
bosons - this is called the Higgs mechanism. In the Standard Model it works 
by coupling a scalar field to the vector bosons in such a way that the photon 
remains massless but the Z does not. 

Let's consider how it works for the B, boson. If we want to couple this 
to a scalar wavefunction then we must modify the y equation so that the 
derivatives in it are gauge-covariant derivatives 


Due = Oe + igy Bu (23.14) 


thereby modifying the Klein-Gordon equation to 


1 ov 
-D,D" =0 23.15 
where we take! 
1 2, 1 2 * 2 
V=Vo— se eet 1X (vv) (23.16) 


We also need the equation for the gauge field, which is 


1. . 7 
0" (0,B, — 0,B,) = 51d (>Dy — (Drp) v) (23.17) 


The quantity on the right-hand side is the current induced by the scalar 
wavefunction q. 

Note that we have taken the scalar wavefunction to be a complex linear 
combination of two wavefunctions, i.e. 


y= $1 + ids (23.18) 


This is necessary because we want our system to be locally U(1) gauge in- 
variant, i.e. invariant under 


p(z) > ey (z) (23.19) 
and the functional degree of freedom o (x) is sufficient to eliminate one of 
the wavefunctions $1 or da (hence the need for two wavefunctions, or a single 
complex c). It is not hard to show that the current induced by the scalar 


wavefunction is conserved: 


v 1 ; * * 
0 319v c Dye — (Dv) p) =0 (23.20) 
as it must be to ensure the U(1) gauge invariance. 


tNote that 


aos (p* gp) = p. The conjugate equation is i (Da Dtp)“ + $5 —0. 
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The y equation is therefore a pair of equations: 
0,0" 1 — i^ $i + A” (07 + 63) 91 — KBB" dr 
—9y0" B,,d2 — 2g, B,O" Q2 = 0 (23.21) 
0,0" b2 — u P2 + X (di + 03) $2 — 9; Bu B" Qo 
+90" B 91 + 2g; B,O" 1 = 0 (23.22) 


The minimum of Y (p) = 0 is now located at ($o1)^--($02)^ = vo = (2^. 
There are many ways of solving this equation, since the minimum lies on a 


circle — in y space — of radius £. We can expand about any ground state 
satisfying this condition, so we might as well write 
(im) - +ê) 60 (2) = ba (2) (23.23) 


in which case we get 

8,0" à; + 212 b1 — 3A? — nO? + X? (ài + 03) ài 
-92 B, Bb + § BB" — gy 0" B Q2 — 29. B 3" às = 0 (23.24) 

0,0" do — 21101 Q2 + X (ài + 03) da — 9 B, BY bs 
+9v0" Bài + 29 B,O" Q4 = 0 (23.25) 
Notice that the mass of Ai is v2p, but the mass of d» is zero. This isn't 
a coincidence — in fact it is a general phenomenon that always accompa- 
nies spontaneous symmetry breaking, as shown by Jeffrey Goldstone [224]. 
Spontaneous symmetry breaking of a continuous global symmetry is always 
accompanied by the presence of one or more massless scalar particles. This 
result is known as Goldstone's theorem and the the massless scalars are known 
as Goldstone bosons. In our example here, the continuous global symmetry 
(made local via the gauge principle) is the U(1) phase symmetry, and the 


Goldstone boson is the q. 
What happens to the gauge field equation? In components it is 


on (ð By = O, By) = Gy (620,91 — (10,62 — gv (oí + 03) B,) (23.26) 


which, when expanded about the ground state becomes 


ge (0, B, = O, By) = gv (9206; = $10,» — Gy G sp 3) By) 


X2 v A vYy2 T A PALA ( . ) 


or alternatively 


2 
Qo" (O, B, = Oy By) + oi) By = gy (930, = $10,092 — gv (ài t 33) B,) 


2 
+ a,b: + QO) p, (23.28) 
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We now see that a miracle has happened - the gauge field has acquired a mass 
MB = A 

We can actually eliminate the d» wavefunction by using the local gauge in- 
variance mentioned above; just choose a (x) = — tan! (02/61) in eq. (23.19). 
'The mechanism still works, and we are left with a massive gauge field with 
three degrees of freedom, and a single scalar particle ài. Note that this is the 
same number of degrees of freedom as in the original system: the massless 
gauge field had two degrees of freedom and we had two scalar particles. By 
promoting the global phase invariance to a local gauge invariance, the mass- 
less Goldstone boson is eliminated, replaced by the extra degree of freedom in 
the now-massive gauge boson. Note that the physical systems are the same 
both before and after expanding about the ground state — what has changed 
is the description of the physics relative to the ground state. 


23.2 Breaking the SU(2) Symmetry 


To implement the Higgs mechanism in Electroweak theory, the Higgs wave- 
function is taken to be a complex doublet under SU(2), which I will call ®. 
We take the covariant derivative to be 


9 9 
D, = 0,0 — i o WO + Py Bue (23.29) 


and repeat the above procedure, setting 


s- D, ho x Es 
~ V2NÓós t idu] AD 
and introducing a potential Y (9) = Vo — 12010 + 1? (9i). Note that 
the Higgs wavefunction has both hypercharge (Y = +1) and weak isospin 
of Z, as it must since we want it to mix the wè and B, wavefunctions after 
spontaneous symmetry breaking. The quantities c^ are none other than the 
Pauli matrices, which generate the SU(2) group (recall eq. (5.17)) 


pea” . abe I“ 
ES == 23. 
| 5155 | ie (23.30) 
where e*P* is the familiar fully antisymmetric Levi-Civita symbol. This means 


that the field strength for the W** is 
FS, = ô WE — O WE — gu WW. eg (23.31) 
which sometimes is written as 


>. E 


E, = ô W, — OW, — gu W, x W, (23.32) 
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taking advantage of the cross-product structure from the ef, tensor. It will 
also be useful to define 


(o + io°) (23.33) 


23.2.1 The Gauge Equations 


The equations of motion for the gauge particles of the electroweak theory are 


Ge ie 
OBL, gu We Eres = cxlo t) xr eoo pae a 


i (o 0*D,d — (D,à)! a*d) (23.34) 
8" (Ə B, — 3p B,) = ie (9! D,o - (D,&)! $) 


5 [VERE wxE + YE + VEX WE + Y EXE] (23.35) 
where we sum over the three lepton families £ = €, M, and €, and the three 
quark families Q = D, 6, and B. Note that for the U(1) wavefunction B, 
each fermion current in eq. (23.35) is multiplied by YY, which signifies its hy- 
percharge. As promised this differs from the QCD/QED case by the presence 
of the Higgs wavefunction 6. 

Before writing down the Klein-Gordon-type equations for ® and the Dirac- 
type equations for the fermions, let's look at this set of equations more closely. 
We see from (23.35) that the U(1) wavefunction B,, has a current coming from 
both the left-handed and right-handed fermions, as well as a current from the 
Higgs. The SU(2) wavefunctions W; also have a Higgs current, but only a 
left-handed fermion current as discussed above. There is also a current carried 
by the W/ themselves, which comes from the left-hand side of the equation 
and is due to the quadratic and cubic terms in the W’s. As with QCD, 
this last current will give rise to triple-W and quadruple-W vertices. Note 
also that the fermion current for the B, does not change the flavor of the 
fermions as they emit a B,, particle, but that the fermion current for the W*^ 
does change the flavor (e.g., turning a muon into a muon-neutrino) due to the 
presence of the 0” operator in Xow (c*) xi. 

'The symmetry of electroweak theory is broken because the minimum of the 
potential 


y($)-2W- m + ae (ia)? 


is not at ® = 0, but rather at 


oio — 92492-49242? (23.36) 
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where v = £. Under a local SU(2) transformation 9" = exp [ioe (x) z| $, 
the potential remains invariant and the equations of motion are covariant 
(provided the gauge and fermion wavefunctions are also correspondingly trans- 
formed). Since there are three arbitrary functions 0% (x) we can use these to 
eliminate three of the ¢’s, allowing us to write 


p " m m) (23.37) 


where h(x) = 0 now signifies the ground state, i.e. the minimum of the 
potential. Making this choice restricts us to a specific choice of gauge, but it 
is one where the physical degrees of freedom are explicit. 

To see how the gauge bosons become massive all we need to do is to insert 
this form for ® into the equations of electroweak theory. It is straightforward 
to show that when (23.37) holds then 


wi- iW? 
D = tows 5 ( AM. B, uy +0 (h) (23.38) 
gw 


and so the Higgs currents to this order are 
ig gw? g 
(èto “Dd — (D,D)'0*D) = (23) LL B, wa} (23.39) 


igy (xi Pe Wie ee I A 
; (è D,® — (D,) $) - (=) gw Gv (Es, +) (23.40) 


and we see that only the linear combination TB, +W; appears (and not Bj, 
and W? independently). Hence the gauge equations (23.34, 23.35) become 


OM Fi, — gwW Fb, et, = PRIM XE + XP (0) x? 

- (y W + O(h) (23.41) 
OY Fiy — gw WI" Fr cp = = XL mw (07) XE. + e XP (c^) xP 

= m W?2 +0 (h) (23.42) 


= Jw 
O F3, — gwW Fee, = SE (c D a iex 


e (= Bu wi) +O(h) (23.43) 


g - 2 
9" (Ə By — 3y By) = 5 [VEX DWNT +YRXRVWXR 
+YP XE WXP + YRXRWXR] 


vA? gv 3 
= (5) Iw Gy moe E Ep (h) (23.44) 
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where we see mass-type terms for the gauge fields emerging. It’s still not quite 
clear where the photon wavefunction is. Since the fermion current on the right- 
hand side of equation (23.44) is not the current (j™)" = Xy ge Qh yupi 
of electromagnetism, we know that the photon can’t be Bp. 


23.2.2 Gauge-Field Mixing 


What then is the relationship between the bosons W} and B, and the Z- 
boson and the photon A,,? And what is the relationship between W^ and 
the weak bosons Wo ? Let's write 


Wzz a (WiciWi) W3 = cos 0w Zp — sin 0w Ay 


By = sin 0w Z, + cos Ow Ay (23.45) 


and insert these expressions into the gauge equations (23.41-23.44). After 
some algebra these become 


0" (Ə WÈ — 0,W£) + (sey w+ 


2 
= 59w0* (WEW3 — WZW3) — g4W*" F3, 
Iw — + Iw _ 4 
+ (05) xi XP (o) xP +O(h) (23.40) 
V2 V2 
à 2 
o” (Op Zv —0, Lig) + (20) (E sin Aw + cos 0%) Lo 
2 Iw 
2 
+ (=) (= cos 6, — sin J (E sin Ôw + cos J A, 
2 Iw Iw 
= gw COS Ow (WHF = W-"F) 
+gw cos Oy (0^ (W7 Wt — W7 W7 
u v v Yu 
gy sin 0 = E 
5 (VERE WAR  YRXROVXR) 
gy sin O E Er 
A (YP XP WXZ + YE XRWXR) 
gw COS Oy , = 
SS xw (99) XE + XE Y (0?) XE) +0 (h) (23.47) 
à 2 
8" (8,A, —8,A,) + (LEY ( 9X. cos Oy —sin Ow) A, 
u m 
2 Iw 
2 
(23) (5 cos Oy, — sin 2 (= sin Oy + cos ax) Z, 
2 Iw Iw 
gy cos 0 i v 
= 2 (VEE xt + YRXRWXR) 
gy COS Ow 


3 (YE XE wxE + YE XR WR) 
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gw Sin Ow 


z (iow (03) xÈ + XE» (0?) xP) + O (h) (23.48) 


where I = P" (Fi, SE EL 

This is a pretty complicated set of equations, so to understand them we will 
have to look at them in pieces. First notice that there is a mass term in the 
last equation for A,. If we want to identify this with the photon wavefunction, 
we must eliminate this term, along with the Z, term because the photon does 
not couple to the Z,. This we can do by setting gy sin Os = gy cos Aw; this 
also eliminates the A, term in the equation for Z,. We also must require that 
the photon current is of the form gej; = Q8x$4, X8. For the leptons we 


have 


e$ o GwSinOw ,0 
ge Je") = LA (mw (0?) xÈ) 


2 
deem [YE xin + YEXEWNE 
= ; (gw sin Aw + gyY "* cos Ow) XT WT 
+ (—gw sin Oy + gyY*" cos Ow) XL Y XT 
ste Va (23.49) 


with an identical structure for the u and 7 leptons. Since neutrinos have 
Q" — 0 the first term on the right-hand side of eq. (23.49) must vanish, and 
we must also have Y "^ = Y *» since both states in the left-handed Ez doublet 
must have the same hypercharge. Finally, the coefficients of both x7 y,xT' 
and XX must equal —e, where e is the electron charge. Putting this 
together we obtain the solution 


1 
gw Sin Ow = gy COS yy = Je = € yu Ye gr” -1 (23.50) 


The hypercharge assignments for the y and 7 leptons are respectively the 
same, and so we obtain —ex*y,x* for the electric current for the leptons. 
For the quarks 


ge Gr")? = PERO (xa, (03) xB) 
Be 808 Ow Ty a + VERB WE 
= E [0 + Y+) nox E E Y^ rog] 
EY yy (23.51) 


2 
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with analogous expressions for the c, s and t,b quarks. Since Q" — 2 and 
Qd = -i, we obtain 


yu = yY% =Y = Yay aye 


(23.52) 


w| = 


4 2 
Yor yr eye 7 eS yR x (23.53) 


for the quark hypercharge assignments. 


23.2.8 Gauge Boson Masses 


We see from eqs. (23.46 — 23.48) that the W* and the Z equations now have 
mass terms 


Mes Más (23) JY sin Oy +cos0y | = => (23.54) 
Iw 2 cos Ow 


yielding the relationship 
Mw = Mz cos dw (23.55) 


between the W= and the Z masses. Since we can measure the mass of the 

W= bosons, and since we can infer the value of gw from the measurement of 

Fermi’s constant (recall eq. (20.27)), it’s not hard to show that v = 246 GeV. 
We can read off the fermion current for the W~ boson as 


wE Iw — 


= vai (o*) x$ 


the same for each quark and lepton doublet. Note that this current mixes 
together the up and down isospin components of a doublet. For example 


ote; = kon) (5) = (5) - ; [OM (23.56) 
es), e) m 


Iw —é +) ,¢€_ Jw Iw —é -\,¢€_ Jw- 
—= y LO = UL e —= y (0 = =EL YV 23.58 
am (ot) x$. va een p (o) xz van (23.58) 


and so 


'This is very much like the isospin symmetry we looked at for the strong inter- 
actions, in which the proton and neutron were placed in an isospin doublet. 
'The general form of a left-handed flavor wavefunction is 


yi = b (23.59) 
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where u refers to the isospin-up wavefunction of $ (ie. u = ve, Vu, Vr, U,C, t) 
and 0 refers to the isospin-down wavefunction of $ (ie. respectively 0 = 
€, 1, T, d',s', b). The full interaction term is then given by summing over all 
$, analogous to the sum over all charged particles in QED. 

Note that the right-handed parts x? have no isospin — they are singlets. I 
will use the notation ug and 0g where u and 0 refer to up-type and down-type 
wave functions. 

The current for the Z is a bit more complicated. If we write 


. NN 
jy = XL I 
then using eqs. (23.50) we get 
‘ gy sin 0 = X = = 
9250 == [Y EXE WXE + YEXRWXR + YE XD WXE TYR XR EXE] 
Jw COS Ora 
2 rra 
i sin 0 : 1 / 
= gw cos Pw Jy — zoz de [gejo" — g” sin Owja] 
= Ze j3) — sin? Ow (j*")" 23.60 
sin Ow cos Ow (G ) sin W (j ) ) ( y ) 
and so we identify 
Je 
= ——————— 23.61 
ue sin 0w cos Ow ( ) 
(j7)" = (j*)" — sin? Ow (j*7)" (23.62) 


From this expression for (j7)" we can pick out all of the weak neutral couplings 
to the Z-boson, which we wrote down before in table 22.2! We see that in 
general 

& = (8-20! sin? Ow) = ch = GG (23.63) 


For a fermion doublet x3 we have 
; 1. : = 
(¡5 = ¿10 — Q? sin? 6w (x8 x8) (23.64) 


and for the total neutral current we sum over $. 


23.3 Fermion Masses 


The equations for the fermions are Dirac-type equations with gauge fields. 
We could add in mass terms directly, but why not make use of the Higgs 
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wavefunction to generate masses for the fermions as well? This would provide 
a unified picture of the origin of mass, in which all particles attain their inertial 
masses through their interactions with the Higgs particle. 
How might we do this? First notice that the Dirac equation for a wave- 
function w 
(iy" (0, — ieA,) - m) v = 0 (23.65) 


can be separated into its left-handed and right-handed parts 


vy" (0, — ie A) vg = mur iy" (0, — ie A) YL = MYR (23.66) 


by multiplying eq. (23.65) by 3 (12:4?) respectively. So the way to in- 
troduce a mass in electroweak theory — where left-handed and right-handed 
particles are distinguished at the outset — is to insert a constant times wr, 
in the equation for Wr and the same constant times wr in the equation for 
wy. A mass-generating term must have the same effect when the symmetry 
is broken. 

For electroweak theory we have left-handed fermion doublets x$ but right- 
handed fermion singlets y?,. This means that in the equation for x5, we will 
need to multiply the doublet E by something that will change it into a singlet. 
The only thing that will do this is another doublet, which we fortunately have 
— the Higgs wavefunction! Similarly, the equation for xŠ is a 2-component 
doublet equation, and so we need a doublet muliplying xŠ to give us a doublet 
structure on the right-hand side. 

In terms of physical processes, we want our mass terms to have the effect 


that 
qu u u qu 
d XL XL a 

+ — and > + 23.67 
i (a) Qi) Gi) ii (o>) ee 
which balances isospin and hypercharge on both sides for both quarks and 
leptons, as you can easily check using equations (23.52) and (23.53). So in 
other words we put a term of the form diys on the right-hand side of the 


equation for x}. This term isn't constant, but if we expand about the ground 


state we have pix? => (ud + hi), and the vx? term will generate a 


V2 
mass. 

What about the equation for x5 ? Here we have what appears to be a 
problem: for the right-handed up-type quarks Y — 4, so if we just replace x% 
with x} in the above equations we find that Y = T on one side but Y = i on 
the other. The reason for this is that the Higgs doublet has Y — 4-1. 

What we need is a doublet with Y — —1. Fortunately this can be done 
without introducing any new particles! Using the familiar e-tensor (but now 


in two dimensions) we construct the charge conjugate ® of the Higgs 


"P M pu Dx 
$i = gt s ES 2 es (23.68) 
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which has Y = —1 and isospin 3, so that the processes 


ou u u qu 
xt (3) - (3) and INES (23.69) 


can be consistently mediated. 

As a last step we also uus terms multiplying x and x% on the right-hand 
side of the equation for x9 7; clearly these must be of the form 9x5, and byt R 
(each of which have hypercharge Y = 3) to balance isospin and hypercharge 
on both sides. 

The equations for the fermions in electroweak theory generate the Higgs- 
fermion-fermion vertices needed to ensure the above processes take place, and 
can be written as 

" (8, - ifta wat ¡Sy 8 p,) x5 = GSK + HP (23.70) 
iq” (8, + i Y^ B,) XR = que (ox?) 


“gy yo n] ORT (Bt HK (23.71) 
^ (8, ee RB) XR=H (o xi) 


where the quantities G3* and H3? are constants, known as Yukawa couplings, 
that parametrize how strongly the Higgs couples to the up-type fermions (via 
G ) and down-type fermions (via H ). Except for the hypercharge operator 
Y, repeated indices are summed over — so when a 0 is repeated in a given 
term it means sum over the down-type objects, and when a u is repeated it 
means sum over the up-type objects. 

'The most general thing would be to sum over all six values of 0 and u. But 
this would lead to processes like ur + 9" — cr, which violate lepton and 
baryon number conservation. So let's make G and H reducible matrices in the 


quark (q) and lepton (1) sectors 


ek. 0 HEE y 
SAR f: 
G -(% a ES) HÍ -( : HFK (23.72) 


where each of GFK, HFK, Gro. and HEK, are 3 x 3 matrices. 


The above structure is still pretty complicated — if we expand about the 
ground state we will get all kinds of mixings between the different flavors. 
However, things are not quite so messy as they might seem because we have 
a lot of freedom to redefine the flavors. For example in the quark sector we 
can define 


= ca, «t (23.73) 

ol = pisi oL = pili (23.74) 
for both the right-handed and left-handed wavefunctions, where for conve- 
nience I have defined 


{ae porar™ after} = (ul, u2 15) 
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and 


Leste: apniresee. pesto — i. o? D 


so that the generations are labeled 1,2,3 in order of increasing mass. Here 
£r isa 3 x3 unitary matrix that interchanges the left-handed-up-type flavors; 
RE is a 3 x 3 unitary matrix that interchanges the right-handed-down-type 
flavors, etc. 

To see what we can do with this freedom, let’s unpack the fermion equations 
(23.71) into up-type and down-type parts 


^ (o, - i$ Wh +e By) uh + Ten wot 


6 
=orGlkyK + ouykok (23.75) 
9v wu + ig” (0, Wi SET 
Ja? u ML + Vy ( +1 Rn u | L 
anaku +P HoE (23.76) 
int (a, +i “oy By) ul = Gist (ue = pu) (23.77) 
n Q E ie B, ) o; 9L = HIKt (auk 469 oÉ) (23.78) 


and then multiply by the £4 and Rq matrices so that we obtain equations in 
terms of the hatted wavefunctions 


^ (a, — ie Ww A3 HZB, + a "WE (cr ce) gk 
+ IK-k 4 3K 

= 9% (cMG Rt) ig +o" (ctt RH og (23.79) 
g x Koko. J J SI 
um Ni [OT ED up dy" (o, + iw; e if B) di 


= IK x IK ~ 
--4*(rMG RI) "uk +0 (cU BRI) eR (23.80) 


2gy 
H AN oe 
(a, 1 3 B ») ak 


Ik 3K 


- (nt Gic) aK — e» (mitate!) (23.81) 
iy" (a, — i= B) " 


aK y 00 (rem A SK. (23.89) 


— ou (RH HILT 

=o" (RHET) 

'This doesn't look like much of an improvement until we realize that any 
complex matrix M when med on the right and left by different unitary 
matrices U1, Uz becomes ut MU, = Mp, where Mp can be chosen to be real 
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and diagonal’. So let's choose 
IK 1 IK IK 1 {14K 
(LEGR) = ¿mid (Lg HRI) = cmi (23.83) 
to obtain 
l g g Hn 
iy" (a, — TW + Ba ya ur + T FWI (ct Lt) ok 
1 5 ~ a 
2 S9 mis +o" (CE Ry) KSK (23.84) 


IK ~ ES 
IN EW (cH eT) ak + iy" (a, quum z Wå + ie B,) ol 


/2 H q q 6 
= Ik ~ 1 2 
= 6" (pao Rt) ak + md (23.85) 
2 "EP. IK 4 
i3" (a. m Pp) ük = -O mba, — e" (RIIGILE) ^O (23.86) 


PS IK . 1 
iy" (a, - iSt B,) $$ =O (RUHL) ^ uf e" mpoL (23.87) 


where there is no sum over the index I in the above equations. Expanding 
about the ground state $" = 0, $? = (v +h) gives 


PX IK ~ 
int (8, — i$ wi + ie B, + yw; (este "9E 


6 ya" 

= mlül +0 (h) (23.88) 

: _ dw _ Gy SI , Iw e 

iy" (a, Ei Wi +i 6 B,) 07, 4 Ja) Wa porum 
= mol, + 0 (h) (23.89) 
DG. Via S 

int (4, HB «) al, = mul O (h) (23.90) 
O B ) ôk = mis, +0 (h) (23.91) 


and so we see that we have also used the Higgs mechanism to generate the 
masses of the quarks! 

I could write these equations in terms of the physical W+, Z and photon 
wavefunctions, but I won't do that here. It's a straightforward exercise since 
equations (23.88) — (23.91) are linear in each of these quantities. Instead, let/s 
turn our attention to the leptons. We can repeat the same exercise in this 


case, setting ul — vl and ol > A. However, there is a crucial difference from 


$ To show this, the polar decomposition theorem indicates that for any complex matrix M 
we can write M = HU, where H is Hermitian and U unitary. Choosing U; = SÍ and 
U> = UTS where S diagonalizes H gives the result. 
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the quark case because in the Standard Model the right-handed neutrino does 
not appear. So we might as well set VL = =G] IK — 0 in the above equations, 
in which case repeating these derivations gives 


y (a, d WI Pg y ol + iol yw CI: = O (h) 
aT Wi (tc KK Tid" (a, + iW + i B,) a = mie, 


n Q = £2) a = mlil + O (h) 


which we can rewrite as 


y (a, ae A Wo + EB, y pa ~ Wt’ = O(h) (23.92) 

Iw el ; . 9w .g p! I7 
o v Vr, + iy" (a, + PW, + iB) ly = mtn (23.93) 
p (a, = iF B BE il, = mia + O (h)(23.94) 


where I have defined im 
pl = (aran) pl (23.95) 


This means that the e, u, 7 leptons get mass, but not the neutrinos — and 
it is the vl that should be identified with the distinct neutrino flavors, i.e. 
vl = Ve, De =v, and v2 = Vr. 

There is a price to be paid for this — if we switch to hatted fermion wave- 
functions here, we must also do it in the equations for the gauge particles. 
This will only modify the currents 


(2) Log O° sin? y (iP) 


14K 1-K AK. 43 laK a 
= -Uz y ¡E — d; rok — sin? Ow | Zu aegis — 20 rK 
2 2 3 3 
I-K- 1-K ak 
+571 ok — "n HOS + sin? Oy (i y) (23.96) 
(¡1,59% = QF (xy) 
24K 12K ~ ak 
= 5H RE — 50 IEE nek (23.97) 
w+ Iw 8 +) 45 
= v O 
ie” = gut (ot) xi 
GX pap y KI oI, 9^ 
= “ty (cic UNS ME ie os 23.98 
/2 LY ( q a) V2 LO v*L ( ) 
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AK E 

| Iw Ut et KJ AJ Iw ~K 
= =20, CL VAT, + alr WV 23.99 
1/2 L ( q 1 WUL 4/2 LYVWYL ( ) 
which we see is not much of a modification at all! The only parts that look 
different are the charged quark currents in equations (23.98) and (23.99), 

which have the form 


AK = PX KJ x ase 
we aK where FE = (guien 9) = pK) 23.100 
L L L q ^q L L 


v2 


The matrix VK! is a unitary matrix, which superficially has nine independent 
components. However, since its only appearance is in the charged currents 
and in the couplings of the W= to the quarks, we can eliminate some of its 
components by redefining the phases of the mni and the 97. This is six phases 
in all, but one of these is an overall common phase that will cancel out in the 
charged current, leaving us with five phases that can eliminate five of the nine 
components of V. 


So the hatted wavefunctions are none other than the familiar quark and 
lepton wavefunctions that we have encountered in QED and QCD. We see from 
the manipulations in this section that while the Higgs mechanism provides an 
origin for the masses of all of these particles, it does not explain the values of 
these masses — they remain as arbitrary parameters. A large coupling of the 
Higgs particle to the top quark becomes the large top quark mass, and a small 
coupling of the Higgs particle to the electron becomes the small electron mass. 
'The mysterious large ratio of the top mass to the electron mass is replaced 
by an equally mysterious large ratio between the Higgs coupling to the top 
and the Higgs coupling to the electron. You might also have guessed by now 
that V is none other than the CKM matrix (21.54): 


Vua Vus Vub Cy 8103 8183 
V= | Vea Ves Væ | = | —s1c2 €1€263 — 8263€? c16253 + 52036? 
Via Vis Vib — 5152 €182€3 + C283€" C1 $283 — cacge” 
(23.101) 


whose components are also arbitrary parameters. We have no explanation for 
the values of these different parameters. At this point in time we can only 
measure them. 


However, there is one scalar particle left over whose mass is undetermined. 
This is the Higgs particle, the last particle in the Standard model yet to be 
discovered. Finding it will not only be a crucial test of Standard Model 
physics, but also provide essential information on the origin of mass. 
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23.4 Appendix: Feynman Rules for Electroweak Theory 


1. NOTATION. Label the incoming (outgoing) four-momenta as p1, p2, ..., Pn 
(pi, Pa, ..., Phn), the incoming (outgoing) spins as 51, 52,..., Sn (85, 55, ...,5],,); 
the incoming (outgoing) weak boson polarizations as &'/,&5,... (£1 ,&5,...) 
and label the internal four-momenta q1,q2,...,qj. Assign arrows to the lines 
as in figure 23.1, in which time flows from bottom to top, dashed lines are 


j] ' 
PS, 


qi» 9 43> 


Pis 


D, Sn 
P252 


P3,53,8 


FIGURE 23.1 
A typical diagram in electroweak theory. The dotted line in the outgoing state 
is a Higgs particle. 


weak bosons (with a,b = +,—,0 labeling the W*,W-,Z? bosons respec- 
tively), Higgs particles are dotted lines, and lines with arrows are fermions 
(if they point upward with time) or antifermions (or if they point downward 
against time). 


2. EXTERNAL LINES. Each external line contributes a factor as illustrated 
in figure 23.2, and all incoming/outgoing Higgs particles have a factor of unity. 


456 An Introduction to Particle Physics and the Standard Model 


E 
incoming "à =e"(p) 


u 
weak " 


bosons | outgoing  .^ u(y 
/ =E 
" (p') 


: : x 
incoming ~=u(p,s) an | incoming “Z5(p,s) 


fermions : 
outgoing ^ fermions outgoing 4 
=u (p',s') = v( 


ps) 


FIGURE 23.2 
External lines in electroweak theory. 


As in QED, factors associated with external lines correspond to the incom- 
ing/outgoing plane-wave states, an assumption that is clearly not valid for 
the unstable weak bosons and Higgs particle (as well as for all but the lightest 
fermions), but which we will take to be valid on the very short timescales in 
collisions. 


3. INTERNAL LINES. Each internal line contributes a factor as shown in 
figure 23.3, where m is the mass of the quark or lepton, Ma is the mass of 


fermion/ i(y:q* m) = 


antifermion 2 2 
q -m q 
q,dy ab 
l| g e ô 
» M a b 
weak a a reer arene ee 
boson = 


FIGURE 23.3 
Internal lines in electroweak theory. 


the weak boson and I, is its decay width. As before, a,b = +,—,0 label 
the W*,W-,Z? bosons respectively, and the ó** in the propagator simply 
ensures that the identity of the weak boson is maintained in every internal 
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line. Once again q? 4 m? because the particle flowing through the line is 
virtual (i.e. it does not obey its equations of motion). These internal lines 
are called propagators. 


The next two rules are the same as for QED, QCD and ABB theory. 


4. CONSERVATION OF ENERGY AND MOMENTUM For each vertex, 
write a delta function of the form 


(IES (ky + ko + ka +--+ kw) 


where the k’s are the four-momenta coming into the vertex (i.e. each k” will 
be either a q^ or a p"). If the momentum leads outward, then k^ is minus the 
four-momentum of that line). This factor imposes conservation of energy and 
momentum at each vertex (and hence for the diagram as a whole) because 
the delta function vanishes unless the sum of the incoming momenta equals 
the sum of the outgoing momenta. 


5. INTEGRATE OVER INTERNAL MOMENTA For each internal momentum 
q, write a factor 
d*q 
(27)* 


and integrate. 


6. VERTEX FACTORS There are many different vertex factors in elec- 
troweak theory because the symmetry breaking yields a lot of distinct inter- 
actions between the various particles. They can be listed in any order, of 
course. I have chosen to list them proceeding from the experimentally most- 
studied interactions to the least-studied. 

First there are the fermion-W vertices in figure 23.4 which differ between 
quarks and leptons. Next are the fermion-Z vertices in figure 23.5 whose 
couplings depend on the identity of the fermion in the process, as indicated. 


f f 
f Cy CA 
Ve, Vy, V 1 5 
es" pn»"T 2 2 
€ ,H,T -i-2sin?0y =3 
u, c, t 4 — $ sin? Ow m 


1 25:522 
d, s,b 23 + 3sin Ow -5 
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w- 
V W- a 
ee HB uf ys 
__ Bw u(1— 5 B sat e 
: sat (1-y?) 0, 

£ 0, =d,s orb 
u, —u,c ort 
V,, is the CKM matrix 

FIGURE 23.4 


Charged weak interaction vertices. 


FIGURE 23.5 
Neutral weak interaction vertices. 


Since the weak bosons themselves have weak charge, they couple to each 
other, and since the W bosons carry electric charge they also couple to the 
photon. This gives the vertices for 3-boson couplings shown in in figure 23.6 
and for 4-boson couplings in in figures 23.7 and 23.8. 

Next we have the vertices describing the interaction of the Higgs particle 
with all of the other particles, shown in figure 23.9. 
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u 

ka Soy 

’ = ie(e, (kı k), + 824 (lo La), + a (s - 4), ) 
yo ki ky 


FIGURE 23.6 
Triple gauge-boson vertices in electroweak theory. 


H K 
Ze Fa 7 l 
w = —i (8w cos8, ) (28,8: — 2.8. — 8,81«) 
Wwe MW 
m rá Ew A 
u K 
RN / i 
w rd Y = ige (2810810 — Bue — 82:80) 
wt Nw 
v "d ht A 


FIGURE 23.7 
Quadruple gauge-boson vertices in electroweak theory. 
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K 
u 
Y Y 
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FIGURE 23.8 


Quadruple gauge-boson vertices in electroweak theory with at least one pho- 
ton. 
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Finally we have the vertices describing the self-interaction of the Higgs in 
figure 23.10. 


FIGURE 23.10 
Higgs self-interaction vertices in electroweak theory. 


The remaining rules are the same as for QED and QCD. 


7. TOPOLOGY To get all contributions for a given process, draw diagrams 
by joining up all external points to all internal vertex points in all possible 
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arrangments that are topologically inequivalent. The number of ways a given 
diagram can be drawn is the topological weight of the diagram. The result is 
equal to —iM. 


8. ANTISYMMETRIZATION Because fermion wavefunctions anticommute, 
we must include a minus sign between diagrams that differ 
(a) Only in the interchange of two incoming (or outgoing) fermions/anti- 
fermions of the same kind 
Or 
(b) Only in the interchange of an incoming fermion with an outgoing 
antifermion of the same kind (or vice versa). 


9. LOOPS Every fermion loop gets a factor of (—1). 


10. CANCEL THE DELTA FUNCTION The result will include a factor 


(27)t8® (p, + p5----- pi, — Pi — 92 —::: — Pn) 


corresponding to overall energy-momentum conservation. Cancel this factor, 
and what remains is —i M. 
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23.5 Questions 
1. Show that v — 246 GeV. 


2. (a) Show that the potential 


1 1 
V(®) = V - 5^9! + 2 (918) 


A EA 2 2 
is minimized at $!$ = tl =Y- 


(b) Show that a general SU(2) transformation ®’ = exp [ioe (x) z ® 


can be used to write 
jon 0 
ma a +h(x) 


where h (x) is a real scalar function. (Hint: use the Euler form of the 
SU(2) transformation). 


3. Show that the current 


1, y 
59v (P Due — (Due) v) 
is conserved for the scalar Higgs field in the Abelian case. 


4. Show that if 


then 


and 


tow (tb oa tota) = (2Y 1 we! 
; (90*D, 0 (0,8) 0%0) = (E) q WWE, aap + O(n) 


=z (neta) (D) omar 


5. (a) How many independent real parameters are there in a unitary N x N 
matrix? How does this answer change if the matrix is orthogonal? 


(b) Consider an extension of the Standard Model with N generations of 
quarks. Since phases of quark wavefunctions are arbitrary, how many 
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independent parameters are there in the CKM matrix of this model? 
For how many generations can the CKM matrix be made real? 


(c) Show for N — 3 that 
Jiajh = Im (Via Vio Vis Via) (no sum over repeated indices) 
provides a measure of CP-violation regardless of the choice of indices. 
6. Show that the Higgs conjugate wavefunction $ 
Qi = gig 
transforms as an SU(2) doublet, but has opposite hypercharge Y — —1. 


7. Write equations (23.88) — (23.91) and (23.92) — (23.94) in terms of the 
physical W+, Z and photon wavefunctions. 


8. Consider a different model for electroweak interactions, one in which the 
Higgs wavefunction is an SU(2) triplet Y of real scalars (analogous to 
the way that the W's are a triplet) instead of an SU(2) doublet of 
complex scalars. 


(a) Write down the potential Y (V) that is SU(2) invariant and renor- 
malizable. 


(b) Find the minimum of this potential. 


(c) Write down the gauge equations for the W bosons coupled to the 
Higgs wavefunction. Set all fermion wavefunctions to zero. How many 
W bosons can gain a mass this way? 

(d) Why isn't this Higgs triplet used to break the symmetry of the 
electroweak model? 


24 
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'The electroweak theory has been scrutinized in great detail for more than 25 
years, with experimentalists seeking to check as many of its predictions as 
possible. Today we have great confidence in this theory, which, along with 
QCD, forms the Standard Model. In this chapter we will consider some of the 
main predictions of the theory and the experimental tests that confirmed it. 


24.1 Discovery of the W and Z Bosons 


'The observation of the weak bosons at CERN in 1983 was a triumph for the 
electroweak theory. The search for these particles began in 1976 when Rub- 
bia, Cline, and McIntyre proposed that the CERN Super-Proton-Synchrotron 
(SPS) be converted into a storage ring in which protons and antiprotons would 
counter-accelerated up to high speeds and then collide [225]. Energies of 270 
GeV per beam appeared feasible, yielding a CM energy large enough to pro- 
duce the W and Z particles, if they indeed existed. Two experiments — UA1 
and UA2 — were set up at the redesigned SPS collider to search for the weak 
bosons. 

'The beam of antiprotons was obtained via the stochastic cooling technique 
[72] developed by Simon van der Meer as discussed in chapter 7. The proton 
and antiproton can be thought of as two groups of quarks and antiquarks, 
each carrying roughly 1/6 of the total momentum. Since the expected masses 
of the W and Z particles were somewhere between 70 and 100 GeV, a CM 
energy of six times this was needed. The initial design was for 540 GeV, to 
be upgraded to 640 GeV later on. 

The calculation of the cross-sections begins at the quark level, then takes 
into account the gluons and the quark distribution functions. At these energies 
the sea quark contribution to the cross-sections is negligbly small, and so the 
annihilating quark is in the proton and the annihilating antiquark is in the 
antiproton. The calculations indicated that 


o (pp W 464 >€ Ve +--+) = 530 pb 
o (pp — Z + — eet) œ~ 35 pb (24.1) 
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at the design phase. These are very tiny cross-sections — about nine orders of 
magnitude smaller than the total pp cross-section of 60 mb. This means that 
the detector needs a discriminating power of about 10!°. Although hadronic 
decays of the W and Z are more frequent, they are very difficult to distinguish 
from the background, and so only leptonic channels were considered. 

The actual decay channels used in the search are listed in table 24.1 The 


TABLE 24.1 
Decay Channels Used in the Search for Weak Bosons 


W= — e®ve isolated electron at high pr and high missing pr 
W= —= u*v, isolated muon at high pr and high missing pr 
Z — e et two isolated electrons at high pr of opposite sign 
Z— wut two isolated muons at high pr of opposite sign 


leptons emerging from the decays must be isolated, i.e. not appearing inside 
a jet. Otherwise, it is not possible to be sure that they came from a W or 
Z instead of the decay of some other quark state. The crucial quantity of 
experimental interest is the transverse momentum pr, which is the component 
of momentum perpendicular to the colliding beams. If this quantity is small 
then we cannot be sure that the lepton in question was produced by W or 
Z and not some other particle in the beam. Consequently the only events 
retained in the large collection of data are those with isolated leptons with 
large transverse momentum. Of course if a neutrino is produced then it cannot 
be detected directly. However, if there is a large amount of missing pr in 
conjunction with an isolated lepton of high pr, then it is virtually certain that 
a neutrino was emitted, signalling the presence of a W boson. 

Any given event will have large numbers of tracks that come from the vari- 
ous particles produced that are different from the W or Z. After eliminating 
(or “cutting” ) all data with low pr, the remaining events having some track(s) 
with large pr are analyzed by neglecting all tracks with pr smaller than 1 
GeV. To measure the mass of the W the “Jacobian peak” method is used. 
When the W is produced its momentum is almost entirely in the direction of 
the beam; it will have only a very small component of transverse momentum. 
The electron and neutrino will each have large transverse momentum, mean- 
ing that the angle between the direction of the W and that of the electron is 
large. If we Lorentz transform to the rest frame of the W, these transverse 
components remain the same (see fig. 24.1), and will be related to the angle 
0* relative to the original direction of motion via 


M 
pr = |p| sin 0* ~ Ee sin0* = > sin 0” (24.2) 


where we can neglect the mass of the electron at these high energies. The 
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FIGURE 24.1 
Decay of the W* in the lab frame and in its rest frame. Note that the 
transverse momentum pr of the positron is the same in both frames. 


transverse momentum distribution, Mem is given by 


dn dn dé dn 2 u 1 dn (24.3) 


dpr  dó*dpr — d0* Mw cos 0* [cy — pp, d 
2 T 


dn 
d0* 


the W. The crucial point is that is that the Jacobian ro diverges when 

_ Mw 
Pr= A 
divergence becomes a sharp peak in the pr distribution at Mw. By counting 
the number of events as a function of pr and locating this maximum, the mass 
of the W can be determined. The UAI experiment found My = 83+3 GeV 
[226], and UA2 found My = 80 x: 1.5 GeV [227]. 

'The mass of the Z boson can be measured by more conventional means, 
since the energies of the emitted lepton and antilepton can each be measured 
by the calorimeter and the angles (054,0,-) between the beam tracks can be 
determined. If the Z exists, it will show up as a peak in the invariant mass 
m (LL), where 


where 


is the angular distribution of the electrons in the rest frame of 


Since the transverse momentum of the W is not exactly zero this 


m (SE = 2m? + 2Ep+ E; — 2pp+pp- cos (05 + 0,-) (24.4) 


and the location of the maximum determines its mass. UAI found Mz = 
93 + 3 GeV [226], and UA2 found My = 91.5+1.7 GeV [227]. Today we 
know these values with considerably greater precision [1] 


Mw = 80.398 + 0.0259 GeV Mz = 91.1876 + 0.0021 GeV (24.5) 
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and the electroweak model with its predictions for the W and Z have been 


spectacularly confirmed. 


Another important prediction of electroweak theory is the lifetimes — or 
decay widths — of the W and Z. Let's begin first with the W. The decay 
diagram for W^ — (7 + 7% is given in fig. 24.2 and is straightforward to 


V, E 


Pi P> 


FIGURE 24.2 
Decay diagram for W^ — (^ + v, 


calculate since it is a two body decay. We have 


Maes (5) y (24.6) 


and neglecting the masses of the leptons in the final state decay rate in the 
rest frame of the W is 


3 
WA Pe — 1 ( dw ) Mw Gr (Mwe?) 
T(W £r = MI = E 
( ? De) Sh (Mo | 3 1242 2mh 62x, 
(24.7) 


which is the same for each type of lepton! The results for the quarks are given 
by the same diagram, except for two things. First, the W is color-neutral 
and so when it decays into a quark-antiquark doublet the colors of the decay 
products must be equal and opposite; since our detectors are color-blind we 
must add over the three colors. Second, the couplings will be multiplied by 
the appropriate CKM matrix elements. 

The results for the full set of decay widths is given in table 24.2. You can see 
that there are no entries for decays of the W into a top quark because of the 
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TABLE 24.2 

Decay Widths for the W Boson 
Channel AD Value (MeV) 
e^ 4 P, LUE 225 
uy, SIM 225 
ru SEO 225 
u+d Sen Vaal? 640 
its Ses Val 35 
64s ey var 660 
Z+b ees Vasl? 1.20 


heavier mass of the latter, a fact not known at the time the W was discovered. 
Furthermore, the decays into b quarks are very rare due to suppression factors 
from the CKM matrix. The total width, obtained by summing over all of the 
partial widths, gives rw =T (W7 — anything) = 2.04 GeV 

The decay of the Z is computed in a similar manner. The decay diagram, 
fig. 24.3, is just like the one for the W except now the couplings are given by 
table 22.2. This means the matrix element is 


M = (=) (££) fea (e - ch) vto) (24.8) 
yielding 
rz 187) = ae qat = E (2) SEE Y le) 
" eun (ch)? + (44)") (24.9) 


where the relations Mz cos0yy = My and gzcosOw = gw have been used 
and I have included the color factor Ne, which is 3 for quarks and 1 for leptons. 

The results for the full set of decay widths of the Z is given in table 24.3. In 
this case all of the allowed decay channels (remember, top decay is forbidden 
because the top is too heavy) make an appreciable contribution to the width. 
'The decay into neutrinos cannot be directly observed, but the total decay rate 
into three neutrino flavors is $^, l'(Z — ve + Ve) = 495 MeV. If there are any 
other unknown particles that couple to the Z but not to the photon or gluons 
— an example would be an additional flavor of neutrino - they will contribute 
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FIGURE 24.3 


Decay of the Z boson into a fermion-antifermion pair 


to the width of the Z if their masses are less than Mz. Measurement of this 
“invisible” decay channel width provides an important check on the number of 
low-mass neutrinos. Likewise, it is not feasible experimentally to distinguish 
the different quark-antiquark channels (though sometimes clever methods can 
pick out the cc or bb ones), but we can obtain the total hadronic decay width, 
which is Pz = T (Z > hadrons) = 2P (Z > u +u) + 3T (Z ^ d + d) = 1.67 
GeV. The total width is given by summing over the partial widths and is 
T (Z — anything) = 2.42 GeV. 

Of course these calculations neglect corrections due to loop diagrams, and 
can be done much more precisely. Today we know the empirical values of the 
decay widths to very good precision [1] 


Tw = 2.141 + 0.041 GeV Pz = 2.4952 + 0.0023 GeV (24.10) 


as well as distinct channels 


T(Z — invisible) = 499.0 + 1.5 MeV 
T(Z => l4-1—) = 83.984 + 0.086 MeV (24.11) 
T(Z — hadrons) = 1744.4 + 2.0 MeV 


and all are in excellent agreement with the Standard Model! An important 
constraint is that there cannot be any more than three light (i.e. less than 


half the mass of the Z) neutrino species, placing an important constraint on 
fundamental particle physics models. 
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TABLE 24.3 
Decay Widths for the Z Boson 
Channel hr Value (MeV) 
c? 3 
Vt + Ve L 165 
Gr(Mzc . E 
£- A a ) (5 2 sin? Ow + 4sin* Ow) 83 
u+u Gr(Mzc?)? 4.2 m 
a 272 (27 38in” Ow + y sint Ow) 280 
d+d $ 
= Gr (Mzc?) ..2 2.4 
H. ak a HH Ow + $ sin Ow) 370 


24.2 Lepton Universality and Running Coupling 


The kinematic criteria that allowed for unambiguous identification of W pro- 
duction and subsequent decay into ev. also applies for subsequent decay into 
pv, and Tvr. Cross-sections for each process can be separately calculated and 
measured, making a test of lepton universality possible. The results indicated 


[1] 


I — 1.00 £0.07(stat) + 0.04(syst) IT — 1.00+0.10(stat) + 0.06(syst) 
= zi (24.12) 
showing that the €z, Mz, and ry, doublets all couple in the same way and 
with the same strength to the W boson. 

This is in accord with the Standard Model prediction, which also states 
that there is absolute conservation of three separate lepton numbers: electron 
number, muon number, and tau number. These conservation laws continue to 
be tested in a variety of ways. For example, the best limits on the conversion 
of one charged-lepton type to another come from u — ey and y — 3e, for 
which [1] 


P(u > ey) 11 P(u > 3e) -12 
12x 1 10x1 
T(u — all possible) E 52 T(u — all possible) eee 
(24.13) 
For the 7 lepton the corresponding limits are not as stringent 
P(T > uy) -8 P(r > ey) 2 
6.8 x 10 1.1x 10 
T(r — all possible) » úl T(t — all possible) S n 


(24.14) 
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though there is still no evidence for any violation of charged lepton num- 
ber. Semileptonic decays and decays of charged leptons into other charged 
antileptons yield similar limits, again in accord with the Standard Model. 

What about the neutrinos? We do have firm evidence that the different 
flavors of neutrinos can change into each other. Pll postpone discussion of 
this subject until the next chapter. 

Another early test of electroweak theory was in the value and evolution 
of the weak mixing angle 0w. You might recall from Chapter 23 that this 
parameter has a clear relationship to the electroweak coupling constants and 
the electromagnetic coupling 


x Je 
Ow = = — E 24.15 
ei 92 = Sin Ow cos Ow ( ) 
and so we have another prediction from the Standard Model that can be 
experimentally checked. It is difficult to directly measure the weak couplings, 
but fortunately we don’t have to. We can instead make use of the relation 


2 
Gr = Y ( n ) , which gives 


8 \Mwe 
29? 292 1 
Mwe = V29% _ vag 72 Ta SES GeV 
8Gr 8sin?0y Gp sinOdw\ y/2Gp  sindw 


(24.16) 
yielding a relationship between the measureable W boson mass, the fine struc- 
ture constant, Fermi's constant, and sinOw. Similarly the relationship be- 
tween the W and Z boson masses 


sin? Ow = 1 — ZW (24.17) 


provides an important cross-check on the validity of electroweak theory, since 
each quantity can be measured independently, though in the case of sin? 6y 
not directly. A variety of low-energy processes provide such checks, illustrated 
in figure 24.4. 


1. Parity violation in atoms. Since electrons can exchange both photons 
and Z bosons with the quarks in the nucleus, there will be a parity- 
violating shift in the energy levels of atoms due to the different helicity 
couplings the electron has to the Z. The effect is only a few parts per 
million, but can be measured and provides a test of electroweak theory 
at the KeV scale [228]. 


2. Forward-backward asymmetry in e* e^ annihilation. There is an inter- 
ference between the virtual photon and Z diagrams, a process we looked 
at in Chapter 22. This asymmetry has been measured in a wide energy 
range [231] between 10 and 200 GeV, and is large for energies near the 
mass of the Z as we have seen. 
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3. Scattering of v, and Y, off of electrons. This process is very clean due 
to the absence of hadronic interactions and provides the most direct 
means of determining the weak mixing angle. However, the cross-section 
is very low — four orders of magnitude smaller than neutrino-nucleus 
scattering. Nevertheless the CHARM2 experiment first provided the 
most precise measurements of this process, yielding the experimental 
result [229] 

sin? Oy = 0.2324 + .0083 (24.18) 


results superseded by yet more accurate measurements [1]. 

Corrections from loop diagrams to the masses of the W and Z bosons modify 
the relationship they have with sin? 0yy. The most important diagrams are 
when the particle in the loop is either a Higgs boson or a top/bottom quark 
pair. The correction from the Higgs loop depends logarithmically on the mass 
of the Higgs boson, but the correction from the quark loop is proportional to 


Gr (m? — mp) ~ Gem; (24.19) 


which is quadratic in the mass of the top quark. A precise measurement of 
the mass of the W thus allows for a prediction of the mass of the top as long 
as the mass of the Higgs is not too large. 

Experiments in the 1990s at LEP measured with increasing accuracy the 
masses of the W and Z and sin? Ow by a variety of methods. In 1993 a bound 
of the top mass was obtained 


m, = 166 + 27 GeV (24.20) 


provided 60 GeV< my < 700 GeV. The mass of the top was measured less 
than two years later (as we saw in Chapter 18) and found to be within this 
bound. 

The coupling constants gw and gy in the SU(2) x U(1) Model both undergo 
renormalization, and hence *run" - that is change with energy (or distance) 
— the way that the electromagnetic and strong couplings do. Since 


IY — tan bw (24.21) 


Iw 


the quantity sin? @y will also be a function of the momentum transfer between 
two particles that scatter due to weak interactions [230]. The dependence 
is more complicated than that for the electromagnetic and strong couplings 
because both numerator and denominator vary. The change is very small, 
only a few percent over more than six orders of magnitude. The predicted 
behavior and experimental values are given in figure 24.3. Fermion loops and 
boson loops contribute to the renormalization but with opposite sign. At low 
energies the fermion loops are important due to the large mass of the W, but 
at high energies loops due to W and Z self-couplings set in. However, these 
contribute to gw (since it is a non-Abelian theory) but not to gy, changing 
the slope of the curve near the mass of the W. 
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FIGURE 24.4 

Running of the weak mixing angle as a function of energy, showing comparison 
with different experimental tests. The acronyms are APV: Atomic Parity 
Violation; Apy: Asymmetry in Polarized Moller scattering; Z: Measurements 
taken when CM energy is the mass of the Z boson (the *Z-pole" measurements 
[232]); v-DIS: Deep inelastic scattering of neutrinos from isoscalar targets 
[233]; AFB: Forward-backward asymmetry at LEP. The width of the curve 
reflects theoretical uncertainty from strong interaction effects [230]. Image 
courtesy of the Particle Data Group [1]. 


24.3 The Search for the Higgs 


'The only particle in the Standard Model that has not been observed is the 
Higgs particle. It obeys the equation 


3 3 
D,D"$ — p?® + 7 (99) 6 = G[XIa Xj + XigCux$ 
JL 
I=1 1,J=1 
3 
+ Y) Xin, (24.22) 
1,J=1 


where the covariant derivative is 
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FIGURE 24.5 
One-loop correction to the W and Z propagators (and hence their masses) 
due to the Higgs boson. 


.g .g 
D,9 = 0,0 — iro" Wo + ¿5 Bub (24.23) 
0 : DUC 
and we remember that ® = (, 4h a — the actual Higgs particle is de- 


scribed by the wavefunction h(x). If we insert this into the equation for ® 
we find after some algebra that its lower component gives 


0h + 21h + 3Auh? + A? n? 


o) wwe ( de -Jo th) 2,2" 


2 cos 0 
3 3 3 
= $ Gin xpo 22 Mind + 22 Min Ma, (24.24) 
I=1 1,J=1 1,J=1 


which shows that the mass my of the Higgs particle is 
mp = V2p (24.25) 


The Standard Model makes no prediction as to what its mp is — the param- 
eter u must be input into the model. This is arbitrariness is not particularly 
attractive, and theorists have wanted to do better. One idea for predicting 
the Higgs mass from the Standard Model was to set jj, = 0 at the outset 
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and then compute the effective potential V;g(h) for the Higgs boson due to 
its interactions with all of the other particles in the Standard Model [234]. 
You might think that the only minimum of the effective potential would be 
at h = 0 since we set u = 0, but quantum loop diagrams generate an effec- 
tive potential V.g(h) that has a new minimum, from which a nonzero Higgs 
mass can be computed, determined from the other parameters in the Standard 
Model. This approach is clearly more attractive, since the arbitrary constant 
u no longer appears. For a number of years it was thought to be viable until 
the discovery of the top quark with its enormous mass (and therefore large 
coupling to the Higgs) appeared to rule out this mechanism [235]. However, 
recent considerations that systematically sum over higher order corrections 
[236] have shown that perhaps this mechanism is viable after all, and that it 
predicts a Higgs mass of mp = 220 — 225 GeV. 


It is possible to obtain bounds on mq indirectly from the Standard Model 
using precision measurements of electroweak observables. As I noted above, 
the masses of both the W and Z bosons get l-loop correction terms (see 
figure 24.5) that depend on the logarithm of the Higgs boson mass. Since 
their masses are known so precisely, we can place a bound on the Higgs mass, 
although not a very stringent one due to the logarithmic dependence. A global 
fit to precision electroweak data, accumulated at LEP, SLC, the Tevatron and 
elsewhere implies that mg < 182 GeV at 95% confidence level [1]. This is 
somewhat less than the 220 GeV prediction above, but not so much less as to 
rule out the effective potential mechanism. 


FIGURE 24.6 
Gluon fusion produces a Higgs boson. 


It's clear we will have to wait for LHC to do its job. At present, the best 
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lower bound on my is from LEP [237]: 
mp > 114.4 GeV at 95%c.l. (24.26) 


which involved colliding electron-positron pairs to produce Z bosons at en- 
ergies larger than Mz. The virtual Z can emit a Higgs boson, which will 
preferentially decay to bb quark pairs due to the heavy mass of the bottom 
quark. The limit that can be reached on the Higgs mass is the difference 
between the center-of-mass collision energy Ecm and Mz. LEP was able 
to attain Ecm of 209 GeV by installing additional superconductive radiofre- 
quency cavities in the ring, providing additional power compensate for the 
large loss of energy in synchrotron radiation. This resulted in the upper limit 
above. 
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FIGURE 24.7 
Other modes of Higgs production: Associate production via W/Z 
brehmsstrahlung (left) and WW or ZZ fusion (right). 


It’s possible to do a bit better. Precision electroweak measurements at 
LEP indirectly constrain mp to be lower than 182 GeV at the 95% C.L. This 


is done by computing how the Higgs particle modifies the ratio An due to 
Z 
loop diagrams that arise from quantum field theory. The ratioM™ has been 


M 
measured to high precision, and yields the above upper limit once the lower 


bound of 114.4 GeV is taken into account. The most recent experimental 
result for the Higgs mass is from Fermilab [238]. By combining all known 
data from proton-antiproton collisions at Eem = 1.96 TeV, researchers have 
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excluded a Higgs boson with mass mp = 170 GeV at 95% c.l., adirect 
restriction, albeit within a very narrow energy range. 


At the LHC the dominant mechanism for producing the Higgs boson is 
via gluon fusion, g + g — h, as in figure 24.6. Since the top is the heaviest 
quark, the Higgs couples most strongly to it, and so this quark loop dominates 
all other possible loops. It is also possible to produce the Higgs via fusion 
of W or Z bosons or via a brehmsstrahlung (perhaps it should be called 
“Higesstrahlung”) process as shown in figure 24.7, but these contribute less 
strongly. 
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FIGURE 24.8 


Branching Ratios for Higgs decay as a function of Higgs Mass. The solid ver- 
tical line is the LEP bound of 114 GeV. Reprinted figure from A. Djouadi, J. 
Kalinowski, and M. Spira, HDECAY: a program for Higgs boson decays in the 
Standard Model and its supersymmetric extension Comput. Phys. Commun. 
108, 56 (1998) [239]; Copyright (1998), with permission from Elsevier. 
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How the Higgs particle decays will depend on its mass. If it is less than 
half the W mass (mj € 140 GeV ) then it will decay into bb pairs. Above this 
mass up to 160 GeV it will predominantly decay into W pairs, and then above 
that into Z pairs, which fully dominate if mh > 180 GeV. Only if the Higgs is 
very heavy — mp S 360 GeV — will it decay into top pairs, and even then this 
is not the dominant process. There are other possibilities once loop diagrams 
are taken into account: the time-reversal of the diagram in figure 24.6 can 
yield decay into gluon pairs (h — g + g) and also the less likely decay into 
photon pairs, which is easily seen by replacing the gluon lines with photons 
in figure 24.6. 

A diagram of the all possible branching ratios for the decay of the Higgs 
boson as a function of its mass appears in figure 24.8. 

The main purpose of the Large Hadron Collider is to carry out a direct 
search for the Higgs boson. If it is found it will put in place the last piece of 
the puzzle in the Standard Model. If it is not found, then we will have firm 
evidence of physics beyond the Standard Model — and undoubtedly another 
revolution in particle physics. 


24.4 Questions 


1. Consider the Higgs mechanism in the Abelian case. Write down the 
equations for the Higgs wavefunction and the gauge wavefunction if the 
Higgs field is written as p= v + h (x). 


2. Suppose we used the SU(2) symmetry to write the Higgs field as 


a= [om 


What would the physical theory be in this case? 
3. Show that the equation for the Higgs wavefunction 
3 3 
D,D'9 — ya c X (She) P= P» GIXTr XL i p» XigCuxj, 
I=1 I,J=1 
3 
+ Y Mirta 
1,J=1 


becomes 


0h + 27h + 3Auh? + Ah? 
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(gw)? —117+u Iw g u 


3 3 3 
v£ £ v vu e 
= COR xij, YS xiu, 
[=1 1,J=1 1,J=1 


0 


when ® = (ene 


| Hint: remember that the gauge currents are 


conserved. 


4. The Higgs potential is 
$)—W l 2619 1 dip)? 
V (8) 0—5H 1^ ( ) 


(a) Suppose Vo = 0. How much energy per unit volume does the Higgs 
potential contribute to the vacuum energy density of the universe if we 
assume A= dc? 


(b) The actual energy density of empty space is smaller than 1074 
GeV/cm?. What value would Vo have to be set to in order to ensure 
that your answer in (a) does not exceed this value? 


5. Compute the decay rate 7 
Viet qoe p 


to leading order in the Standard Model, where f is a lepton or a quark. 
6. Compute the decay rates 


(a) h = W* +W" 
(b)h=>2Z+2Z 


to leading order in the Standard Model. 
(c) Plot the ratio 
T(h—W*-4-W-) 
T(h >Z+zZ ) 


as a function of h. 


7. Someone comes to you and proposes to find the Higgs using an ete” 
machine by examining the process 


ete —Mh—oW*-4W- 


so that the existence of the Higgs particle can be inferred by finding 
deviations relative to other intermediate states (like the photon or Z) 
in producing the WtW- pair. Is this a good idea? 
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Beyond the Standard Model 
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The Standard Model - the comprehensive theory that describes our current 
understanding of the electromagnetic, strong, and weak interactions is not 
expected to be the final theory of physics. There are three main reasons 
underlying this expectation. 


1. It has too many adjustable parameters. There are six different 
quark masses, three different lepton masses (since neutrino masses are 
assumed to be zero), one Higgs mass, four mixing angles from the CKM 
matrix, one mixing angle associated with ground state of QCD (as we 
shall see), and three coupling constants (one for each force). This is 
a total of 18 arbitrary parameters that are not predicted by the Stan- 
dard Model itself, which is mathematically valid and theoretically self- 
consistent for any numerical values of these parameters*. If one allows 
for the possibility of neutrino masses then an additional seven parame- 
ters must be added to this list — three for the masses of the neutrinos 
and four associated with the mixing angles for a CKM-type matrix in 
the lepton sector — bringing the total to 25. If one includes gravitation, 
then two additional parameters — the gravitational coupling constant G 
and the cosmological vaccum energy (parametrized by a cosmological 
constant A ) — must also be included bringing the total number of unex- 
plained adjustable parameters to 27. It is generally believed (or at least 
preferred) that a more complete theory [240] will have fewer adjustable 
parameters — ideally only one! 


2. Gravity is not included. The problem with including gravity is not 
simply a matter of adding Einstein's equations of general relativity with 
its two additional constants (G and A) to the model. If we neglect quan- 
tum mechanics, writing down the standard model in a self-consistent 
way that includes Einstein's theory of general relativity is well-known 
and understood. However, no one knows how to include gravity in a 


*T should qualify this statement, since there are mathematical issues concerning the exis- 
tence of quantum field theories that are not fully resolved. What I mean by this statement 
is that all of the mathematical manipulations performed to extract physical predictions 
from the Standard Model would be just as valid if other values for these 18 parameters had 
been used besides the ones measured in experiments. 
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quantum framework [241]. There are three major difficulties that must 
be overcome. 


e General Relativity is not a renormalizable theory. Not only are 
quantum corrections to gravitational processes infinite (as they are 
for gauge theories), but the number of these infinite corrections 
is also infinite (unlike the situation in gauge theories). The renor- 
malization procedure works in the Standard Model because we only 
ever need to redefine a finite number of parameters (such as mass 
and charge) to eliminate these infinities. However, in gravity, every 
possible physical quantity would need to be redefined to eliminate 
the infinite corrections. Doing this renders the quantum theory 
useless for predicting anything because there is always another ad- 
justable infinite parameter that could be renormalized to agree with 
experiment. 


e Time is treated differently in gravity. In quantum theories time 
plays the role of an ordering parameter that tells us how a quan- 
tum system evolves from the past into the future. However, in gen- 
eral relativity you can change your coordinate definitions of time 
and space pretty much any way you want, with only a few restric- 
tions due to mathematical self-consistency. This means that two 
different observers in a gravitational field can have very different 
definitions of how time advances, and it is not clear which defini- 
tion(s) is(are) the right ones to use in incorporating the quantum 
theory. 

e Causality becomes uncertain. The light-cone structure of special 
relativity ensures a rigid separation of past and future for any 
observer at any place and time. The Standard Model - indeed 
all quantum gauge theories — rely upon this structure for making 
predictions. For example the scattering theory we developed in 
chapter 9 depends upon a clear definition of past quantum states 
that are emitted by sources and future quantum states that are 
absorbed by detectors. However, in general relativity gravity dif- 
ferentially slows down time, making the light-cone structure depend 
on the location of the observer. Quantum gravitational corrections 
are therefore expected to introduce some quantum-mechanical un- 
certainty into the structure of the light-cone (and our notions of 
causality) at each point in space and time. This leads to a strange 
conundrum — we won’t know how to compute this kind of light cone 
without first having a quantum theory of gravity, but we won’t be 
able to construct a quantum theory of gravity without first under- 
standing this kind of light cone! 


Theorists worldwide have been trying to make progress on these prob- 
lems for more than 40 years, and many ideas have been proposed. Yet 
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there has been no clear path forward, and a quantum theory of gravity 
appears as elusive as ever. Uniting gravity and quantum theory is a very 
deep problem, regarded by many as the most challenging and significant 
problem in physics today. 


3. The universe is sharply biophilic. This means that our universe is 
not a typical specimen out of all possible universes one might consider, 
but instead has rather particular features that permit life. Of course it is 
clear that we could not live in a universe that did not permit life to exist. 
However, what has been slowly discovered over the past 50 years is that 
even rather minor adjustments of the parameters Standard Model and 
the structure of our cosmos would result in a universe that could not 
support carbon-based life as we know it. For example, if the mass of 
the up-quark differed by less than a percent from its experimental value, 
then neutrons would not decay into protons as observed. Either they 
would decay too quickly (if the up quark were a bit lighter), leaving 
pretty much only protons in the early universe, or they would decay too 
slowly (if the up quark were a bit heavier), making too much helium in 
the early universe, with little or no hydrogen to fuel the fusion processes 
in stars. In either case one of the conditions necessary for life as we 
know it would no longer hold true. Such arguments can be constructed 
for many different Standard Model parameters, including the masses 
of the light particles, the coupling constants of all of the forces, and 
the strength of the cosmological vacuum. It is rather peculiar that the 
existence of life is so sensitively correlated with the actual values of these 
parameters of the Standard Model. The reason for this situation goes 
beyond the explanatory power of the Standard Model [242]. 


There has been much effort expended in the past 15 years to find what 
physics lies beyond the Standard Model. In this chapter I will discuss some 
of the progress that has been made so far, and what the prospects are for 
further discovery. 


25.1 Neutrino Oscillation 


The best experimental evidence we have for new physics beyond the Standard 
Model is in the neutrino sector. A solid body of evidence has accumulated 
through a number of experiments indicating that the different flavors of neu- 
trino can change into each other, a phenomenon called neutrino oscillation or 
neutrino mixing. 

The idea of neutrino mixing goes back to 1957, when Bruno Pontecorvo sug- 
gested [243] that the neutrino-antineutrino system could oscillate in a manner 
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analogous to the K°K° system. At that time only one neutrino flavor was 
known. When the v,, was discovered ten years later he pointed out that flavor 
oscillations between v, and v, were not ruled out by any known experiments 
[244]. In the meantime two different Japanese groups had also proposed that 
neutrino flavors could mix. 

'The favored explanation for this phenomenon is that neutrinos are massive. 
If this is so then there will also be a 3 x 3 CKM-type mixing matrix in the 
lepton sector of the Standard Model, in which case we have physics beyond 
what the Standard Model predicts. As with the quarks, this will imply that 
the neutrino weak-eigenstates (which we call ve, V, and v.) are not the same as 
the neutrino mass-eigenstates. This means that an electron neutrino produced 
at a given point will, after propagating outward, have some probability for 
changing into one of the other two neutrino types. 

To illustrate how neutrino oscillation works, I'll begin with a 2-flavor model. 
As we will see later, this will turn out to be a good approximation for atmo- 
spheric neutrinos. Since the physical eigenstates must have definite masses, 
we have the relationship 


y" = du) =V Gao) - 322 (25.1) 
(vs) lv.) 

between the mass eigenstates and the flavor eigenstates, where V isa2x2 

matrix. Note that I have labeled the mass eigenstate wavefunctions with the 

numbers 2 and 3, using the convention of labeling in order of (anticipated) 

decreasing electron-neutrino content. Since probabilities are conserved, we 
have 

mtr —(wF) ur svivel (25.2) 


and so V is unitary. We can actually make it orthogonal by redefining the 
phases of the neutrino wavefunctions, as we did for the down-type quarks. 
Hence the most general form of V is 


e Gey d = e >) (25.3) 


—sin 093 COS 053 —8$23 C23 


where I have used a notation that anticipates a full 3 x 3 CKM-type matrix 
in the neutrino sector, and for convenience I have written cos 055 = C23, etc. 

Working non-relativistically, since the W”” wavefunction contains states of 
definite mass-energy, we have 


which has the solution 


|ui (£)) = e7 |v; (0)) = V" (t) = exp [-iHt/fi] V^ (0) (25.5) 
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and so the flavor eigenstates have the time-dependence 
V (t) = V-!w" (t) 2 (V^! exp [-iHt/h] V) Y” (0) (25.6) 
Multiplying the matrices out (and dropping the /is) gives 


vu (t))\ _ [c+ e PS ss 023523 (e Pt — e 159) N / |v (0)) 
D. (t)) ) B ow (es = great) cia, pg PRA ) ce (0)) ) 
(25.7) 
Atmospheric neutrinos (that come from pion decays in cosmic rays) and 
laboratory neutrinos (from accelerator experiments) are essentially pure vj, 
sources. This means that we should set v,(0) = 0 as one of our initial 
conditions. After a time t some v,’s will appear, with amplitude 


|v- (t)) = €23523 (ee =, CR [Vu (0)) (25.8) 
The probability of flavor conversion is therefore proportional to 


—i —i 2 
Ku, (t) lu, (0) ^ = e352 [e77 — =F] pu, (0)? 


l. 
= sin? (2923) (2 — 2 cos ((E» — E3) t)) lv (0)? (25.9) 
and so we obtain (reinstating the appropriate factor of A) 


P (v, — v7, t) = sin? (2053) sin? (22) (25.10) 
where the latter result is actually the relative probability, since plane-wave 
states can't be normalized. Note that the only wave to conserve lepton flavor 
is if the mixing angle 923 = 0 — any deviation away from this value indicates 
physics beyond the Standard Model. 

Now I will make an approximation that goes beyond what I have assumed 
in using the non-relativistic Schroedinger equation (25.4), but which is valid 
for high-energy neutrinos, namely that rest-mass energy of the neutrinos is 
small compared to their total energy. For high-energy neutrinos all of the 
same momenta we therefore have 


2 2 
A da Ms 
(Es — Ez) = yo? +m? - yP + mà »(1 E (1+ 74) 


Am? Am? 
E ETA = E (25.11) 


where Am? = m3 — m3 and E is the average energy of either neutrino. The 


path-length traveled in a time t is approximately L/c (since they are nearly 
moving at the speed of light), so 


Ame)’ L 
P (v, — v,,t) = sin? (2093) sin? (ec (25.12) 
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for neutrinos of energy E moving through a distance L. ^ The probability 
that the »,, survives is then 


Ame)’ L 
P (1, — Vp, t) = 1 — sin? (2053) sin? Ru) (25.13) 


Laboratory experiments are best suited to measuring P (v, — Vu, t). A 
device capable of detecting v, is placed a distance L away from a source 
of vy, of known flux and energy E, and the data are used to constrain the 


value of 253 and (Ame?y. The expected flux and energy at the detector are 
calculated, and the survival probablity is the ratio between the measured flux 
at the detector and the calculated survival flux. 

In practice there will be a spread of energies in the flux of neutrinos emerg- 
ing from the source. This will modify the simple trigonometric dependence 
above and therefore change the survival flux. It's not too hard to anticipate 
how. Suppose we have muon neutrinos at two similar (but not the same) en- 
ergies emerging from the source. For a short period of time there will not be 
much difference in the survival probability for each energy. However, as time 
increases the low-energy v,, will more rapidly change flavor. Since the survival 


2 

probability is minimized when E — ud the lower-energy neutrino flux 
reaches its smallest value before the higher-energy one does. After this the 
low-energy Y, flux will increase while the high-energy v, flux continues to 
decrease. The two fluxes soon become out of phase, with high-energy flux 
maximized when the low-energy flux is minimized and vice versa, with the av- 
erage flux oscillating about a mean value dependent on sin? (2054). Averaging 
over a spread of energies, the v, flux will decrease rapidly from unity and after 
several oscillations settle down to a mean value about which it oscillates with 
small amplitude. 


25.2 Neutrino Experiments 


Neutrino oscillation is of much current interest in particle physics, more so 
than ever since experimental evidence at the Sudbury Neutrino Observatory 
(SNO) has definitively confirmed the phenomenon. There are three main 
sources of empirical evidence for this effect. 


25.2.1 Solar Neutrinos 


Standard nuclear astrophysics indicates that our sun should be emitting a 
flux of neutrinos that is three times larger than has been observed. Much 
effort has gone into trying to find out whether the discrepancy is due to some 
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flaw in our understanding of the solar interior, or whether we need to modify 
our theory of neutrino physics [245]. The results from the SNO experiment 
have definitively adjudicated on this issue [42], indicating that the different 
neutrino flavors do indeed change into one another. 

Our sun burns hydrogen into helium via a thermonuclear process, in which 
the average thermal energy is a few tens of KeV. This is small compared to the 
Coulomb barriers of the interacting nuclei. However, the core density of the 
sun (10? kg/m?) is large enough that the reaction cross-sections converting 
hydrogen into helium can proceed. 


p+tp>D+e +v, 


| 


D+ p—>`He+y 
. Y 
14%| Her He>'Be+y “He+'He> p+p+*He | 86% 
Y Y 
99.9% | "Be+e — Li+y+v, "Betp—>*B+y 0.01% 
Y y 
Li+ p>*He+*He KB + p—*He+*Het+e* +v, 


FIGURE 25.1 
The pp-cycle of the Standard Solar Model. 


About 95% of the energy produced by the sun comes via the conversion 
of four protons into a Helium nucleus, through a chain of reactions shown 
in fig. 25.1. Neutrinos of varying energy are produced as a by-product. 
These stream through the sun with essentially no absorption’. A “Standard 
Solar Model” (SSM) has been developed over the past 60 years that reliably 
describes the details of this chain of reactions and the neutrino flux produced. 

Most of the v, flux produced by the sun is due to the conversion of two 
protons into deuterium. Detecting this is an experimental challenge because 
all the neutrinos produced are of low energy (less than 420 KeV). The highest 
energy neutrinos — up to 14 MeV — are produced in the Boron conversion to 


+The photons produced in these same reactions follow a different path. They interact 
with the rest of the material in the sun, producing new photons of lower energy that in 
turn further react with this material. This sequence continues until eventually photons are 
produced at the surface of the sun that stream off into space, carrying energy that was 
produced from the original reaction several thousand years earlier. 
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FIGURE 25.2 

The energy spectrum of solar neutrinos. Image reprinted from J. Bahcall, 
A.M. Serenelli, and S. Basu Ap. J. 621, L85 (2005) [?]. Reproduced by 
permission of the American Astronomical Society. 


two Helium nuclei. This is a very rare process since it depends on a rare 
production channel from the Be + p interaction. The more common Be 
interaction produces a Lithium nucleus along with a photon and a neutrino, 
which emerges at either 400 or 860 KeV. These predictions of the SSM are 
illustrated in fig. 25.2. 
Pontecorvo proposed that solar electron neutrinos could be detected via the 
reaction 
ULP Cl > e7 +° Ar (25.14) 


in 1946, long before neutrinos were actually detected [246]. John Bahcall 
and collaborators computed the reaction rate for this process in 1962, using 
the solar model that he and his collaborators were beginning to construct 
[247]. While the inital reaction rate appeared to be too low for detection, 
a superallowed transition at 5 MeV was discovered afterward, raising the 
reaction rate by a factor of 20 to experimentally feasible levels. 

Even so, the rate was still very tiny, and to observe it would mean using 
a huge volume of Chlorine in a region that was shielded as much as possible 
from all radioactive sources and from cosmic rays. An experiment was set up 
in the Homestake mine in South Dakota, USA, beginning in 1964 led by Ray 
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Davis, using 615 tonnes of C2Cly as the target “detector” [248]. The expected 
rate of Argon production is about 1 atom per day. This would be extracted 
every few weeks using a stream of helium gas and then put into a counter to 
detect the argon decays. The energy threshold for the experiment was 814 
KeV, which meant that only the higher-energy Be and B neutrinos could be 
detected. 

The experiment ran for 24 years (1970-94) and yielded the following result 
after 108 runs [249] 


2.56 +0.16 +0.16 SNU (exp't) 
8.1413 SNU (SSM) 


T (v, +97 Cl 5 e- +97 Ar) = { (25.15) 


expressed in Solar Neutrino Units (SNUs), which correspond to 1 capture 
per 10% atoms per second. There is a clear discrepancy by a factor of 3 as 
compared to the SSM. 

Of course discordant results were coming in from this experiment much 
earlier than 1994, and physicists began to explore other ways to measure the 
solar neutrino flux $. One attempt was the Kamiokande experiment, which 
used a water Cerenkov detector in the Kamioka underground observatory in 
Japan to measure the neutrino flux due to the elastic-scattering reaction 


vete —vpkte (25.16) 


which is sensitive to all neutrino flavors. However, this reaction can only 
proceed via Z-exchange for f = u,T, whereas it proceeds via both W and 
Z-exchange for electron neutrinos. Since the latter reaction has a cross- 
section six times that of the former, it is possible to check if the flux of 
solar neutrinos is purely v, — a diminished cross-section is indicative of a v, 
and/or v, flux component. The measured flux [250] — confirmed by super- 
Kamiokande (super-K), its successor — was found to be half the expected 
value. In particular 


(25.17) 


(2.35 + 0.02 + 0.08) x 10% /em?/s (exp't) 
(5.69 + 0.91 ) x 10% /cm? /s (SSM) 


Super-K was able to measure the direction of motion of the final state electron, 
and one could infer from this that the flux of neutrinos was definitely from 
the sun (and not some other source) [251]. 

Two other experiments were developed to measure the low-energy neutrino 
flux due to the pp reactions: SAGE in Russia [252] and GALLEX (later 
upgraded to GNO) in Italy [253]. These experiments respectively used 60 and 
30 tonnes of gallium, which was used to measure the rate of the reaction 


Ve +"! Cos e7 pre (25.18) 
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at a threshold energy of 233 KeV. SAGE is still running, but in 2003 the 
GALLEX/GNO experiment ended. The results were 


69.344.1+3.6 SNU (GALLEX/GNO) 
T (ve +" Ga e" +7 Ge) = 70.8+5.3+3.7 SNU (SAGE) 
126 + 10 SNU (SSM) 


(25.19) 
again indicating a clear discrepancy between theory and experiment. 

Could the SSM be wrong? This question troubled theorists and experi- 
mentalists alike for many years until 1995, when the GALLEX experiments 
reached sufficient precision to almost certainly rule this option out. By then 
the super-K experiments had measured the flux due to the Boron neutrinos 
and the flux inferred from solar luminosity due to the dominant pp contribu- 
tion was well known. The sum of these was greater than the flux GALLEX 
had measured. But since the GALLEX experiment is sensitive to not only 
these two sources of neutrino flux but also to the Beryllium neutrinos, there is 
no reasonable way to modify the SSM because the Boron nucleus is a product 
of the Beryllium reaction. 

Something was causing a large fraction of electron neutrinos to vanish on 
their way from the sun to the earth. The Sudbury Neutrino Observatory 
(SNO) was designed to see if neutrino physics was the cause. This experi- 
ment uses 1000 tons of heavy water (D20) located 2000 meters deep in the 
Creighton mine near Sudbury, Ontario. It began in 1999 and continued until 
2002. 

SNO measured the flux of solar neutrinos from the decay of 5B via the 
charged current (CC) reaction on deuterium 


v-HDopctpce- (25.20) 


mediated by W bosons and sensitive exclusively to ve’s, as well as the neutral 
current (NC) reaction 


vp -Dpctnav, (25.21) 


mediated by the Z boson and sensitive to all flavors. SNO also measured the 
elastic scattering of all neutrino flavors vy with electrons [254]. 

Since the CC rate directly gives the ve flux and the NC rate gives the sum 
of all three fluxes (with all being equal according to the Standard Model), it 
is possible to infer the flux 9, that is due to the other two flavors. In 2002 
SNO confirmed that this flux was significant, as illustrated in fig. 25.3. The 
latest results from SNO in 2008 yielded [255] 


BNC — 5,54 + 0.33 + 0.34 x 10%/cm?/s (25.22) 
CC = 1,67 + 0.05 + 0.08 x 10% /cm? /s (25.23) 
BES = 1.77 + 0.24 + 0.10 x 10%/cm?/s (25.24) 
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for the flux rates for all three processes. SNO has also precisely measured 
the total flux of Boron neutrinos from the sun independently from previous 
methods and finds agreement with SSM calculations, further indicating that 
new physics in the neutrino sector is responsible for the solar neutrino deficit. 
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FIGURE 25.3 

Fluxes of Boron neutrinos deduced from SNO data [42]. The bands represent 
uncertainties of 1 standard deviation. Image reprinted from Q.R. Ahmad et 
al. Phys. Rev. Lett. 89, 011301 (2002); used with permission from the 
Sudbury Neutrino Observatory; copyright (2002) American Physical Society. 


25.2.2 Atmospheric Neutrinos 


Cosmic rays contain a large number of pions which (as we saw in Chapter 
21) preferentially decay into muons and their associated (anti)neutrinos. The 
muons in turn decay into electrons (or positrons for the antimuon) and their 
associated antineutrinos (neutrinos). Hence one expects to see a flux of v, 
that is twice that of ve. However, the observed ratio of fluxes is only 60% of 
the expected amount. 
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Atmospheric neutrino oscillations were first discovered at Super-K in 1998 
[256]. This experiment consists of 22,500 tons (fiducial mass) of water in the 
Kamioka underground observatory. The water is used as a Cerenkov detector, 
which is sensitive to the processes 


Vu +N >w +N’ vo -HN—e +N’ (25.25) 


where N is the nucleus of either hydrogen or oxygen in the water. The elec- 
trons and muons each leave a single Cherenkov ring on the detector. These 
rings provide important information: the ring produced by a muon is ex- 
pected to have a sharp shape, whereas the electron shape is fuzzy due to an 
electromagnetic shower resulting from interactions the electron has with the 
water in the detector. Fully contained muons and electrons are identified by a 
pattern recognition algorithm exploiting the maximum likelihood method to 
compare the measured distribution of hits with the expected shapes. Some- 
times a particle is not fully contained but exits the central volume into the 
outer detector — in this case it is identified as a muon since only muons have 
such great penetrating power. 

The neutrino energies range from a few hundred MeV to several GeV. The 
cross-sections are strongly forward-peaked at these energies, and so the direc- 
tion of the outgoing electron or muon (which can determined from the direc- 
tion of the ring ) is strongly correlated with the direction of the neutrino. The 
amount of Cherenkov light is a measure of the energy of the lepton, which in 
turn is correlated with the energy of the neutrino. 

This information is enough to determine the distance the neutrino has trav- 
eled from its production point in the atmosphere. The flight length can vary 
from about 10 km (if they are produced in the atmosphere above Japan, where 
the zenith angle Y = 0) to 12,000 km (if they are produced on the other side 
of the earth, where Y = 7). 

Events are divided into low-energy (less than 1 GeV) and high-energy (up 
to several GeV) types, and are also divided into e-type and p-type events. 
Both "fully contained" (FC) events — when all of the visible energy is con- 
tained within the inner detector and *partially contained" (PC) events — when 
particles deposit visible energy in the outer detector were considered. The ve 
data show no indication of oscillation whereas the low-energy v,, data show 
a flux deficit that increases with increasing zenith angle [257], which is the 
same as increasing distance from the detector. The high-energy v, have no 
deficit for small zenith angles but beyond a certain zenith angle the flux be- 
comes about half of its expected value. These results are illustrated in fig. 
25.4. Experiments at the Gran Sasso lab in Italy and SOUDAN2 in the 
USA confirmed the v, flux deficit seen at super-K [258]. Since only the v, 
flux is affected, this is almost certainly indicative of v,,/v, oscillations. In the 
context of the 2-flavor model described above, the value of the distance can 
be used to determine Am? and the value of about > in the flux reduction 

T 


suggests that v23 c 7. 
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Zenith angle distributions for: (a) FC electron-like events, (b) FC muon-like 
and PC events, (c) FC muon-like events and (d) PC events. Down-going is 
when the cosine equals 1. The histograms with the shaded error bars show 
the Monte Carlo predictions with their statistical uncertainties [257]. Image 
reprinted from Y. Fukuda et al. Study of the atmospheric neutrino flux in 
the multi-GeV energy range Phys. Lett. B436, 33 (1998) fig. 5; Copyright 
(1998), with permission from Elsevier. 


25.2.3 Laboratory Neutrinos 


Another way to search for neutrino oscillations is by producing a source of 
neutrinos and then see if they disappear somewhere further along their tra- 
jectory away from the source. Disappearance experiments for v,, were carried 
out in both Japan at the KEK laboratory in Tsukuba and at Fermilab in the 
USA. In each case two detectors were used, one near the source to measure 
the initial flux and energy, and the other far away to measure the surviving 
flux. In Japan the far detector was super-K at 250 km away. In the USA 
MINOS was the far detector at 735 km from the source. These experiments 
also found a v, flux deficit consistent with super-K. 
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The Kamioka Liquid scintillator Anti-Neutrino Detector (KamLAND) ex- 
periment provided important confirmation for the SNO experiments. This 
experiment used several Japanese nuclear power plants as a source for elec- 
tron antineutrinos. Each source had a different power and was at a different 
distance (typically about 180 km) from the KamLAND detector. This detec- 
tor, an ulta-pure scintillation detector, could accurately detect and measure 
the energy of recoil electrons from elastic scattering with the antineutrinos. 
This information can be used to infer the electron antineutrino flux and energy 
spectrum, which was found to be 0.658 + 044(stat) +047 (syst) of the value 
expected if no electron antineutrinos had disappeared. Plotting the ratio of 
the observed spectrum to the no-oscillation spectrum as a function of 1/E, 
KamLAND found that there was flux oscillation consistent with the sinusoidal 
L/ E dependence that is characteristic of neutrino oscillation [259]. 

In 1995 the Liquid Scintillator Neutrino Detector (LSND) group at Los 
Alamos presented evidence that muon antineutrinos could change into elec- 
tron antineutrinos [260]. The LSND detector consists of an approximately 
cylindrical tank 8.3 m long by 5.7 m in diameter, whose center is 30 m from 
the neutrino source. The tank is filled with 167 metric tons of liquid scintilla- 
tor consisting of mineral oil and 0.031 g/l of b-PBD, which affords detection 
of both Cerenkov light and scintillation light. The Cerenkov cone and the 
time distribution of the light give excellent particle identification. Cosmic ray 
muons going through the detector are tagged by a veto shield surrounding it. 

'The experiment searches for the reaction 


ptbpe—->ettn (25.26) 


followed by n+ p— > D+ 4(2.2 MeV). The 2.2 MeV photon y is determined 
to be correlated with a positron (or not) using a likelihood ratio that depends 
on the number of hit phototubes for the photon, the reconstructed distance 
between the positron and the photon, and the relative time between their 
production. The experiment found a total excess of 51.0 + 20.2 + 8.0 events, 
which (if due to neutrino oscillations) corresponds to an oscillation probability 
of (0.31 + 0.12 + 0.05)%. 

This experiment, while intriguing, has yet to be confirmed. The related 
KArlsruhe Rutherford Medium Energy Neutrino (KARMEN) experiment sear- 
ched for such an effect but saw no oscillation [261]. However,, this experiment 
differed from LSND in that the neutrinos travel a distance of 18m at KAR- 
MEN but 30m at LSND. The KARMEN results do not cover the full region 
of parameters that LSND does, and so it is possible that both experiments 
are correct. 

While atmospheric experiments all suggest that v,,’s predominantly decay 
into v,'s there is as yet no direct test of this expectation. The CNGS ex- 
periment (for CERN Neutrinos to Gran Sasso) is a proposal to make such 
a test [262]. In this experiment a v,, beam from the CERN SPS machine is 
aimed through the earth's crust to the OPERA detector at the Gran Sasso 
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laboratory 737 km away. This detector is sensitive to the reaction 
vp+N—>r +N (25.27) 


which is achieved as follows. The neutrino energy must be above 10 GeV to 
ensure that the above reaction is above threshold. However, this means the 
flight length is small compared to the oscillation length, and so only a small 
number of v,’s are expected, which in turn means a small number of 7^ 
events. This means the OPERA detector must have a large mass (to maximize 
cross section) and a high resolution (to distinguish the decay products of the 
77). Using a combination of emulsion and electronic techniques, OPERA 
will achieve micrometer scale resolution with a mass of 2000 tons. 


25.3 Neutrino Masses and Mixing Angles 


From a particle physics viewpoint, there are two possible explanations for 
neutrino oscillations. The most favored is that neutrinos have mass. A flux of 
electron neutrinos produced inside the sun could, if sufficiently massive, oscil- 
late into muon neutrinos which would go undetected in the experiments which 
have been carried out to date. In 1985 Mikayhev and Smirnov [263], building 
off work done by Wolfenstein [264], showed that this neutrino oscillation pro- 
cess could be greatly enhanced inside the core of the sun because of the high 
density of nucleons there. This effect remains the favored mechanism for ex- 
plaining solar neutrino oscillations because it enhances only electron neutrino 
oscillations into the other two flavors. 

Another possibility is that gravity is responsible [265]. Neutrinos, instead 
of being massive, might couple differently to gravity relative to the manner in 
which baryons couple to gravity. According to the equivalence principle, all 
forms of mass-energy couple in the same way to gravity, so one is postulating 
here that the equivalence principle is violated in (at least) the neutrino sector 
of the Standard Model. If so, then there is a discrepancy between the weak 
eigenstates and the gravitational eigenstates of neutrinos, permitting neutrino 
oscillations. 

These two mechanisms can be distinguished from one another: the mass- 
mechanism has an oscillation probability which varies inversely with energy, 
whereas for the gravitational one the probability is proportional to the energy. 
So neutrino oscillation experiments act as a test of the equivalence principle 
[266]. The data from K2K have indicated that in the v,-v; sector mass 
oscillations appear to be the dominant mechanism [267], though both could 
be operative — perhaps neutrinos are massive particles that couple unequally 
to gravity! 
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Today all world experiments indicate the following for the neutrino mix- 
ing matrix, which has come to be called the Maki-Nakagawa-Sakata or MNS 
matrix [268]. It relates the mass eigenstates (labeled 1,2,3) to the weak 
eigenstates 


|ve) Ver Vez Ves l1) 
lvi) = Va Vi2 Via [v2) (25.28) 
|vz) Ver Vr2 Via |va) 


The MNS matrix has the same form as the CKM matrix (21.55), but with an 
additional feature. Writing cos vj; = cij and sin Ñi; = sij the MNS matrix 
is 


Va Veo Vea 
Via Vie Via (25.29) 
Va Veo Vez 
C12C13 —812C13 sige ^? gH 0 0 
= — 812023 — c128938130% C12C23 — s1259358130% $23C13 0 e20 
512823 — €12€23813€ —0€12823 — 5120235136? C23C13 0 0 1 


and we see that there is an additional diagonal phase matrix on the right. 
For quarks this phase matrix can be absorbed into redefinitions of the quark 
wavefunctions. However, it is possible that neutrinos are identical to their 
own antiparticles, in which case we call them Majorana particles. We have 
already observed antiquarks to be distinct particles from quarks, but we have 
no corresponding evidence that the same situation is true for neutrinos and 
antineutrinos. 

If neutrinos are their own antiparticles then the e/^* phases cannot be 
absorbed into a redefinition of neutrino wavefunctions. These phases influence 
neutrinoless double beta decay and other processes, but do not affect flavor 
oscillation of neutrinos. Each of aj, a2, and 6 contribute to CP violation — at 
present we have no empirical information as to the values of these parameters. 
Note that the size of CP violation in neutrino oscillation will depend on s13, 
since 6 always occurs in combination with this parameter. 

We've seen above that the characteristic time (or length) for conversion 
from one flavor into another in a 2-flavor model is inversely proportional to 
(Ame?)”. In a full 3-flavor model the same kind of relationship would hold, 
but with Amc? replaced with much more complicated expression depending 
on the neutrino masses. However, the atmospheric neutrino data are are very 
well described by the hypothesis that the oscillation is purely a 2-flavor one 
between v, and v,. Furthermore, the empirical value of 253 suggests that 
these are maximally mixed, i.e. the conversion between the two flavors is as 
large as possible. This greatly simplifies the interpretation of the data. The 
SNO and KamLAND data, in conjunction with atmospheric and reactor data, 
suggest that two neutrino mass eigenstates are significantly involved in solar 
neutrino evolution. SNO and KamLAND also establish that the mixing angle 
for this process is large. Finally, bounds on the short-distance oscillation of 


Beyond the Standard Model 499 


reactor electron-antineutrinos imply that VA < 0.032, which in turn means 
that sł, < 0.032. 

If we put together all current experimental information on neutrino oscilla- 
tions we find the following experimental values [1] 


Dia 200 = 33.9 2 Daz z Vum = 45^ +82 913 < 3.2% (25.30) 
Ams, Ami =8.01735 x 1075 ev? (25.31) 
2 a = 2409 x 1073 eV? (25.32) 


atm 


2 2 
Am, = Am3, = Am 


I've used the convention here that the numbers 1,2,3 represent the order of 
decreasing Ve content, and used notation to take into account that observed 
results indicate that essentially 0,2 is the “solar mixing angle” and 253 is the 
“atmospheric mixing angle." 

'The current situation is depicted in 25.5. This picture shows that two of 
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The currently understood possibilities for the mass spectrum of the weak 
neutrino eigenstates. Notice that the standard weak-flavor combinations sig- 
nificantly mix together for each mass eigenstate. 


the three mass eigenstates are close together in mass, and that there is a 3rd 
mass eigenstate that is somewhat heavier — or lighter! At this point in time 
we don't know which is the case. Unlike the quarks, each mass eigenstate has 
lots of mixing between the different flavors of neutrinos. For example, the 
middle-mass eigenstate on the left (or the top one on the right) is an almost 
equal superposition of all three neutrino flavors. 

There is another intriguing possibility suggested by the KARMEN/LSND 
results. If both of these experiments are correct then the best fit to the data 
implies that 


0.2 eV? Am? < 1 eV? (25.33) 


LSND 
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which is a huge mass splitting compared to the solar and atmospheric cases. 
However, if there are only three mass eigenstates we must have 


Am, + Am, + Am? — 0 (25.34) 


and it is not possible for Am?,,,,,Am?,, and Am?,,, to satisfy this equation. 
So if all experiments so far are yielding reliable results, there must be at least 
one more flavor of neutrino! Confirming this would definitely indicate that 
there is a lot of new physics beyond the Standard Model. 

Neutrino physics should yield some of the most interesting information 
about what lies beyond the Standard Model over the next few years. As 
experiments (such as LSND/KARMEN) are refined and as new observatories 
(such as SNOlab, the successor to SNO) collect data, we should learn much 
more about the behavior of the neutrino sector of the Standard Model. 


25.4 Axions and the Neutron Electric Dipole Moment 


Another possibility for finding new physics beyond the Standard Model con- 
sists of investigating the structure of the neutron. Recall from Chapter 6 that 
under time-reversal T, the magnetic dipole moment ji of a particle changes 
sign, but the electric dipole moment d does not. Since the only direction that 
a subatomic particle at rest “picks out" is the one given by its spin, dipole 
moments must either be aligned or antialigned with the spin direction. But 
under T the spin flips sign but d doesn't, violating time-reversal. Assum- 
ing that CPT is a symmetry, then this means that CP is violated for any 
elementary particle having a nonzero electric dipole moment, or EDM. 

If an elementary fermion had an electric dipole moment then there would 
be an additional vertex in the Standard Model as illustrated in fig. 25.6 where 
xr" = t (h", 3"]) and the constant ga is a coupling constant (having units 
of charge times length) parametrizing the strength of the EDM. Except for 
the factor of y?, this vertex looks just like that for the anomalous magnetic 
moment interaction induced by loop corrections in QED. 

This vertex violates both T and P and so it does not exist in QED. Purcell 
and Ramsey in 1950 [269] considered a rather unconventional model in which 
the EDM of the neutron respected P, which was thought to be a symmetry of 
the world at that time. They obtained the limit 


gs 10 e-cm (25.35) 


for the neutron. Once parity was found to be violated by the weak interactions 
it was still thought that the EDM of the neutron (if it had one) was T-invariant 
and thus also CPT-invariant by virtue of the CPT-theorem. However, when 
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FIGURE 25.6 
Vertex for a fermion with an electric dipole moment. 


CP-violation was observed in 1964, the subject of the EDM of the neutron 
became of particular importance in physics, since it afforded a new test of CP 
physics. 

QED is invariant under each of C, P, and T and as a consequence treats both 
right-handed and left-handed quarks on the same footing. It was originally 
thought that QCD respected the same symmetries since gluons coupled with 
the same strength to both right-handed and left-handed quarks. However, in 
the 1970s a subtlety in QCD was discovered that indicated it did not satisfy 
CP symmetry. The reason for this has to do with the vacuum (or ground 
state) structure of a non-abelian gauge theory. 

A gauge particle (like a photon or gluon) that is in a configuration that has 
vanishing field strength at large distance must be a pure gauge artifact. For 
the photon this just means that its wavefunction is A, — 0,0, where a is 
some function that vanishes sufficiently rapidly at large distance. Another 
gauge transformation can be made to set A, — 0 everywhere. This is clearly 
the lowest energy state of the photon, and is regarded as the vacuum state of 
QED. 

However, for gluons the situation is different. There are distinct non- 
vanishing configurations of the gluon wavefunction that are pure gauge at 
large distance. These configurations all have vanishing field strength at large 
distance but cannot be set to zero (or equal to each other) by another gauge 
transformation. Since each of these distinct configurations has a field strength 
that vanishes at large distance, each can be regarded as a possible vacuum 
state of QCD. 

What distinguishes these configurations? The answer has to do with their 
topology. Two field configurations of a gauge particle are said to be topolog- 
ically equivalent if there is a mathematical transformation that continuously 
changes one into the other without making the energy infinite in the process. 
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For example, suppose there were a theory in which there were two poten- 
tial energy minima for two different types of field strengths. If the barrier 
between these minima were infinite then there is no continuous transforma- 
tion we could make to change a field configuration whose energy was in one 
minimum to that in another minimum. In this case we would say the two 
configurations were topologically inequivalent, even though their minimum 
energy was the same. 

A helpful analogy consists of considering a rubber band. Suppose we have 
a pole of finite length. We could either wrap the rubber band once around the 
pole or not. Suppose the band is wrapped around the pole once. Since the pole 
is of finite length we can easily remove the band from the pole by lifting it off 
of one end. This is a continuous set of operations (a continuous mathematical 
transformation) and so the configuration with the band wrapped around the 
finite pole is topologically equivalent to the one where it is not around the 
pole. We can do this for any configuration of the band: whether it is wrapped 
once, twice, or many times (as long as the rubber can be stretched). 

Now suppose the pole has infinite length. A rubber band that is not 
wrapped around this pole is topologically distinct from one that is wrapped 
once around this pole, since the latter cannot be taken off of the pole by any 
continuous transformation. Since the pole is infinite we can't lift the band 
off. We could of course break the band and then glue it back together away 
from the pole, but this is not a continuous transformation since the band is 
no longer a band for part of this procedure. So we say in this case that 
the two configurations are topologically distinct. In fact, any configuration 
in which the band is wrapped around the pole a given number of times will 
be distinct from one where it is wrapped a different number of times. If we 
imagine having a band that won't break no matter how much it is twisted, 
then there will be a countably infinite number of different configurations that 
are topologically distinct. We can label these by the number of times the band 
is wrapped around the infinite pole — a quantity called the winding number. 

The situation with gluons in QCD is similar to that of the rubber band 
and the infinite pole. In QCD the gauge group is SU(3) and the gluon can 
be in a configuration that at large distance has vanishing field strength and 
so approaches a pure SU(3) gauge transformation. However, this transfor- 
mation can't be set to zero the way that we can for QED because the gauge 
configuration is ^wrapped around" all of space analogous to the way that 
the rubber band is wrapped around the infinite pole. The configuration can 
be unwrapped (zero gluon field strength everywhere), wrapped once, twice, 
or many times around all of space*. The different configurations are distin- 


tTThese wrappings are described mathematically by something called a homotopy group. 
The homotopy group in this case is 73 (SU(3)) — Z, which means that there are countably 
infinitely many topologically distinct maps from the group SU(3) to the three-dimensional 
sphere, each map labeled by an integer. These distinct maps are what I am calling “wrap- 
pings". 
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guished by a winding number. Since each of these “wrapped” configurations 
has vanishing field strength at infinity, they qualify as possible vacuum states 
(or ground states) of QCD. These configurations all violate CP, except for the 
configuration where the winding number is zero. 

Which one of these is the “correct” ground state? You might think that we 
should just choose the one with zero winding number, but this is wrong be- 
cause of a subtle non-perturbative quantum effect. Here is where the quarks 
enter the picture. Recall that in order to get sensible quark masses, we had to 
separately transform the set of right-handed and left-handed quark wavefunc- 
tions to ensure that the Yukawa coupling matrix to the Higgs wavefunction 
was diagonal with all elements real. This can be done, of course, but one 
transformation required to ensure this involves simultaneously transforming 
all left-handed quark wavefunctions with one phase and all right-handed quark 
wavefunctions with the opposite phase: 


V^, = eir Vs = e Pg (25.36) 


'There is a non-perturbative quantum effect that induces from this transforma- 
tion one of the wrapped configurations of the gluon vacuum that has nonzero 
winding number. So even if we started with a vacuum of zero winding num- 
ber, this quantum effect would give us a vacuum that had a non-zero winding 
number once we made the above transformation. There is a non-zero proba- 
bility to tunnel from a vacuum with zero winding number to one with nonzero 
winding number. 

So why not include all the winding numbers, and say that the actual vac- 
uum of QCD is some linear combination of them all? In fact, this is what is 
(and should be) done. Since each wrapping is labeled by a winding number 
n, its quantum state can be denoted by |n). Each vacuum state is separated 
from those with a different winding numbers by an energy barrier. Using 
methods beyond the scope of this text [270], it is possible to show that quan- 
tum mechanical transitions can take place between these different vacua, with 
amplitudes 


(m| e~*#* In) o exp a (25.37) 


which is a non-perturbative effect (there is no good series expansion of the 
right-hand side of eq. (25.37) for small g,). The amplitude is small for small 
strong coupling gs, and large for large gs. Since we have tunneling between 
the different vacua, the true vacuum state must be a linear superposition of 
all of them. As with periodic potentials in quantum mechanics (whose true 
ground state is the Bloch wave), we define the true vacuum to be 


l) =X e n) (25.38) 


and it is possible to show that this vacuum is invariant under all possible 
SU(3) gauge transformations. However, it is not CP-invariant. 
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The parameter 0 labels the different possible physically inequivalent sectors 
of QCD. We could choose any value of 0 that we want and compute physical 
processes in a gauge-invariant manner. Unlike the vacuum states |n), which 
can interact with one another via quantum tunneling, each |0)-world cannot 
communicate with any other |0)-world. Consequently 0 becomes another con- 
stant of nature (telling us what vacuum we live in), one of the undetermined 
parameters of the Standard Model. 

Can we determine 0 by experiment? Yes — this is again where the quarks 
come in. We can make another transformation of the form in eq. (25.36) 
on the quark wavefunctions to eliminate 0 entirely, setting it equal to zero. 
This sounds good, except that now our quark mass terms have a complex 
phase that induces the CP-violating vertex shown in fig. 25.6. We can further 
redefine the phases of all of the quark wavefunctions to get rid of this phase 
everywhere in the equations of motion except for the mass term of one quark, 
which we might as well take to be the up quark (though any quark would 
do). This phase is of the form exp (i9/N,) where Ny is the number of quark 
flavors. Hence if one quark mass were zero, there would be no CP-violation 
of this type in QCD. 

However, experiment tells us that all quark masses appear to be nonzero. 
'The net effect of this phase is to induce the vertex shown in fig. 25.6 for the 
up quark only. We could go to some trouble to compute the amplitude for 
the neutron, by constructing an effective neutron vertex for the electric dipole 
moment, but we don't need to in order to estimate how large this effect is. 
The coupling ga must be proportional to |0| and the mass m, and charge 
of the up-quark. This has units of (charge) x(mass), but on dimensional 
grounds the dipole moment must have units of charge times length, which 
is the same as charge/mass, since the Compton wavelength of a particle is 
inversely proportional to its mass. Hence we need to divide by the square of 
a mass. The only mass scale for the neutron is the mass of the neutron Mn, 
so on dimensional grounds 


gan A 9| e = 10716 ll e-cm (25.39) 


a result that is supported by more detailed calculations. The best experi- 
mental limits today imply [1] 


[ga] < 107% ecm (25.40) 


which is very tiny, and sets a bound of |6| < 107°. 

'This suggests that perhaps 0 — 0, and that the actual vacuum of our world 
really does conserve CP after all. The problem with this is that we have 
no mechanism within the Standard Model for setting 0 — 0 any more than 
we have a mechanism for setting one of the parameters of the CKM matrix 
to a particular value. However, the near-vanishing value of 0 is suggestive 
that perhaps some dynamical mechanism is at work to naturally make 0 = 0. 
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Peccei and Quinn in 1977 proposed such a mechanism [271] in which 0 became 
a dynamical variable that relaxed to the minimum of an effective potential 
where C and P were both conserved. This mechanism requires the existence 
of a new pseudoscalar particle called the axion. 

The original axion from the Peccei-Quinn model was ruled out by exper- 
iment, but new versions of the model appeared in which the axion couples 
weakly enough to ordinary matter to have thus far escaped detection [272]. 
The common feature of all such models is that there is some U(1) symmetry 
that is spontaneously broken at some high energy and is slightly broken by 
the non-trivial QCD vacuum, providing a small mass for the axion. 

If there is an axion, it will most likely be observed as result of its interactions 
with the light quarks, or in other words with the low-mass hadrons. However, 
because axions couple to gluons, they indirectly couple to pions. The most 
general model for coupling the axion to up and down quarks leads to a theory 
(that I won't go into detail about here) that describes interaction of the axion 
with pions. To leading order in the pion decay constant fr, this theory leads 
to the result [273] 


f. mum 6 x 1073 (MeV)? 
A Es 25.41 
Maxion fa (ma + ma) m fa ( ) 


where fa is the “axion decay constant” with units of energy that parametrizes 
the energy scale of the spontaneous breaking of the U(1) symmetry. 

The interactions that axions can have with fermions would modify predicted 
cross-sections for a whole range of accelerator and reactor experiments. By 
setting fa to be large enough we can easily make these modifications tiny 
enough to have escaped detection so far. Since axions couple to pions they 
also couple to photons. Stars can therefore “cool off’ by emitting axions, 
and limits on the rate of stellar cooling of red giant stars force fa > 10" GeV. 
A stronger limit [274] comes from the supernova SN 1987A: the observed 
neutrino pulse from confirmed the theoretically expected cooling speed young 
neutron stars to be a few seconds, which means that excessive cooling by 
axions cannot take place, forcing f, > 10'°GeV. 

This last limit means that axions are very light — about 6 x 10? eV. You 
might imagine that searching for such a low mass particle in the lab would be 
very challenging. The most promising approaches exploit the fact that axions 
and photons will convert into each other in the presence of an external electric 
or magnetic field. An external magnetic field will cause axion-photon oscil- 
lation in a manner similar to neutrino-flavor oscillations, whereas the electric 
field due to a charged particle (like a nucleus) will make the conversion an 
effective scattering process of the form y+Ze — a+ Ze, a phenomenon called 
the Primakoff effect [275]. If a photon is moving in a direction perpendicular 
to an external magnetic field it will convert into axions moving in the same 
direction as the beam [276]. This phenomenon can be searched for by sending 
a laser beam down the bore of a two long superconducting dipole magnets 
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(like the bending magnets in high-energy accelerators). Axions will be gener- 
ated in the first magnet provided the oscillation length is comparable to the 
length of the magnet. Putting a barrier to the laser beam in between the two 
magnets will cause only axions (if there are any) to move through the second 
magnet, which will convert them to photons that can then be detected. So far 
such searches have yielded null results [277], and have not yet placed limits on 
axion couplings that are better than what we have from stellar observations. 

It is also possible to get an upper bound on fa from cosmological consid- 
erations [278]. Axions will be produced in the early universe, contributing to 
the overall energy density of the universe. However, the axion energy den- 
sity doesn't rapidly dissipate into other particles because axions couple very 
weakly to known matter. This "invisible" energy density will exceed the 
amount that would cause the universe to be closed (contrary to observation) 
unless fa < 10!2GeV. So perhaps there really are axions out there! 


E: oo RA 


25.5 Frontiers 


Neutrino physics and axions represent only a small portion of the effort being 
expended today to understand what lies beyond the Standard Model. In 
the remaining part of this chapter I will sketch out a few of the many other 
approaches being taken to extend the frontiers of our knowledge. 


25.5.1 Dark Matter 


Axions and neutrinos both form examples of what might be called “dark 
matter” — matter that interacts so feebly with known matter that its presence 
in the universe on cosmological scales can almost certainly be detected only 
via its gravitational interactions with known matter. The latter half of the 
20th century led to the construction of a standard cosomological model of the 
universe that describes all known cosmological data in a single framework. 
The total mass-energy of the universe is now known to be within about 1% 
of the critical value needed for the universe to be closed and finite (the so- 
called critical density) [1]. Within the context of this model it is known that 
only 4% of this consists of known matter, which is inferred from the relative 
abundance of hydrogen, helium, and other light nuclei. Most of this — 3.6% 
of the total — is in the form of interstellar gas, with the remaining 0.4% in the 
form of stars and planets. 

Another 23% of the energy density is dark matter — a form of mass-energy 
whose presence is known only via its gravitational attraction to other luminous 
(and known) matter. This kind of matter neither emits nor absorbs photons 
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(or if it does, it does so at a rate so feeble as to have so far escaped detection). 
'The composition of this matter is an outstanding puzzle for both particle 
physics and cosmology. It was once thought that neutrinos could be the dark 
matter, but it is now known that they contribute at best 196 to this amount. 
It is generally thought that dark matter is some new kind of particle not 
described by the Standard Model. In view of the limits above, axions are a 
viable candidate today, but there are several other possibilities. A new kind 
of "sterile neutrino" — one that can convert into known neutrinos but does not 
participate in the weak interactions — is another possibility. Yet another is 
a WIMP, short for Weakly Interacting Massive Particle. WIMPs experience 
weak interactions, but have escaped detection in accelerator experiments so 
far due to their presumed large mass, which is perhaps 10s or 100s of GeV. 
Supersymmetry (see below) provides a theoretical framework for describing 
such particles [280]. 

Trillions of WIMPs must be passing through the Earth each second if they 
indeed exist. Experimental searches for WIMPs are ongoing. Some attempt 
to directly detect them with a laboratory detector, whereas other experiments 
search for indirectly for WIMPs via the products of their decays and anni- 
hilations. Some experiments have claimed detection (for example EGRET 
[281], CDMS [282] and DAMA/Nal [283]) but without confirmation, and oth- 
ers have found only null results. A number of experiments are currently in 
progress, and still others are in development. The LHC counts as one of these 
— it could detect WIMPs as a form of missing energy and momentum provided 
their coupling to ordinary matter were sufficiently strong and their masses not 
too large [284]. 


25.5.2 Dark Energy 


Normal matter — including dark matter — is gravitationally attractive. It there- 
fore imposes a deceleration on the expansion of the universe, slowing down the 
expansion in a manner analogous to the way the earth's gravity slows down 
a projectile launched from its surface. An outstanding goal in cosmology for 
a number of years was to measure the amount of this deceleration. In 1999 
this goal was achieved by looking at supernova data. Supernovae of type la 
can be used as reliable distance indicators (standard candles) in astronomy 
because their absolute luminosity is known. They are visible at very large 
distances, and so can be used to infer the deceleration rate. Distant super- 
novae should appear brighter and closer than their redshifts would otherwise 
suggest. 

When the measurements were completed, most supernovae were found 
to be dimmer and further away than expected. In other words instead of 
decelerating, the universe was accelerating in its expansion! This means that 
there must be some other form of matter with a kind of gravitationally repul- 
sive character, and it must be in enough abundance to overcome the attractive 
decelerating effects of the known and dark matter. Further measurements in- 
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dicate that the remaining 73% of the energy density of the universe is of this 
type: it is a very peculiar form of matter that has negative pressure. 

'The simplest explanation of this form of matter is that it is a relic vacuum 
energy, parameterized by what is called a cosmological constant [285]. This 
means that the dark energy is the same at all times and places. This introduces 
yet another parameter into the Standard Model as noted earlier. However, this 
parameter is very problematic for physics. All subatomic particles contribute 
to the vacuum energy density, and an estimate based on known methods 
from quantum field theory estimate a value for this vacuum energy density 
that is 10120 times the amount that is observed. This is perhaps the largest 
discrepancy between theory and experiment known in science [15]. 

As you might have guessed, many physicists have attempted to provide some 
other explanation for dark energy by making it depend on space and time in 
some way. These approaches typically suggest the existence of new particles. 
Others have attempted to modify General Relativity, our established theory of 
gravity, at large distances and/or over long cosmological times. On general 
grounds there is no reason at this point in time to prefer a cosmological 
constant over any of these other models, and the question of the composition 
of dark energy is wide open [8]. At present, cosmologists and astronomers 
are in the process of mapping out the historical development of our universe 
to see if the dark energy is indeed constant by measuring thousands of more 
supernovae and the distribution of mass on very large scales. 


25.5.3 Grand Unification 


'The fact that weak and electromagnetic interactions can be unified into a sin- 
gle theory suggests that the strong interactions should somehow be included 
as well. Such theories were called “Grand Unified Theories”, or GUTs, since 
they would unify all non-gravitational forces into one grand theoretical frame- 
work, with one coupling constant governing them all. 

How might this work? We've seen in chapters 14, 18 and 23 that each of 
the coupling constants is a function of energy. The electromagnetic coupling 
ge grows as a function of energy whereas the other two decrease. So it is 
not unreasonable to imagine that they might all have the same value at some 
energy. A rough estimate puts this energy scale at 1016 GeV — the grand 
unification scale. 

The simplest GUT was a model with SU(5) as the gauge symmetry group 
[286]. It’s simplest in the sense that this is the group with the smallest num- 
ber of generators that can include the groups of the strong and electroweak 
theories. All of the quarks and leptons in a given generation were assigned to 
be in irreps of this group. There are fifteen such particles: the right- and left- 
handed electron, the electron-neutrino, and up and down quarks of 3 colors 
each, each right and left handed. The smallest two irreps — the quintet and 
the decuplet of SU(5) — were used: three left-handed charge-conjugate down 
quarks, the left handed electron and the electron-neutrino were placed in the 
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quintet — so they would be regarded as one kind of “particle,” and the rest 
(charge-conjugated as needed) were placed in the decuplet, the other kind of 
“particle.” That all the particles did not fit into a single irrep was regarded 
as somewhat unattractive, and a different GUT, based on the group SO(10) 
allows all known particles (plus the right-handed neutrino) to fit into a single 
irrep, the 16, of this group. 

Some kind of Higgs mechanism is presumed to break the SU(5) symmetry 
down into the SU(3) x SU(2) x U(1) symmetry of the Standard Model (with 
the SU(2) x U(1) in turn broken by the Higgs mechanism we looked at in 
chapters 22 and 23). But in the absence of symmetry breaking all the leptons 
and quarks in a given generation have the same mass, and interact identically, 
attracting/repelling one another under a single force, which in SU(5) would 
be governed by the exchange of what are called leptoquark bosons, a general- 
ization of gluons. There are 24 of them. Twelve are really just the photon, 
W=, Z and gluons in disguise; the rest are new bosons that change leptons 
into (anti)quarks and vice versa — hence the name, analogous to the way that 
a W boson changes an electron into its neutrino partner, or a gluon changes 
a blue quark into a green one. Leptoquark bosons come in two types (.X with 
charge +4 and Y with charge +4, along with their antiparticles) and couple 
antiquarks to leptons and quarks to antiquarks; some examples of vertices 
in the SU(5) model appear in figure 25.7. Under symmetry breaking these 
leptoquark bosons become very massive, about 1015 GeV. 

'This is very heavy - but not infinitely heavy. Every now and then a down 
quark could emit one of these objects and change into a positron. The lep- 
toquark boson can then collide with one of the remaining up quarks in the 
proton, changing it into an anti-up quark. This in turn binds with the other 
up quark to produce a 7°. These kinds of theories therefore predict that 
baryons are unstable, and that the proton can decay via the process 


p—> et +r’ (25.42) 


as shown in figure 25.8, albeit very slowly since the leptoquark boson is so 
heavy. 

The original estimates for this process put a bound on the range of lifetime 
of the proton [287] as 10?! > 7, > 10% years, twenty orders of magnitude 
longer than the age of the universe! This sounds nigh impossible to detect, 
but if a sufficiently large number of protons are collected together — say about 
10%, roughly the amount in a swimming pool of water — then about 1 per 
year should decay. Since the decay channel (25.42) is so unique, there is a 
chance of actually observing this process. 

Well, people looked hard and carefully over a series of increasingly precise 
searches, but didn't see any evidence for proton instability [288]. The current 
lower bound is [1] 

Tp > 10% years (25.43) 


ruling out the original SU(5) model and putting constraints on all GUTs. 
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FIGURE 25.7 
Examples of leptoquark boson vertices in SU(5). The vertex rules carry with 
them appropriate combinations of color to ensure color conservation. 


The reluctance of the proton to decay as hoped for put a damper on GUT 
models. To make matters worse, the three coupling constant were also found 
to not actually coalesce at a single value of the energy. This was due to pre- 
cision measurements at LEP. Prior to these measurements there was enough 
uncertainity in the values of the constants that within limits of error they all 
had the same value at an energy of 6 x 1014 GeV (which led to the prediction 
in eq. (25.42) for the proton lifetime). However, it is now known that the 
three constants do not meet at a single point in the original SU(5) model 
[289] — see figure 25.10. 

So if there is to be a Grand Unified Theory, it must include some kind of 
new physics beyond the leptoquark bosons that must appear in such a scheme. 
Either there must be new (unstable) particles with masses in between the 
GUT-scale of 101% GeV and the electroweak scale of 100 GeV, or new gravity 
effects, or new energy scales less than the GUT scale at which the GUT 
symmetry (whatever it is) is broken. 


25.5.4 Supersymmetry and Superstrings 


A vexing problem of the Standard Model (known as the hierarchy problem) 
is connected with the Higgs particle. The mass of any subatomic particle will 
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FIGURE 25.8 
A possible means for the proton decay process in eq. (25.42) to take place in 
SU(5). 


be corrected by loops that consist of the fermions and gauge bosons from the 
Standard Model that couple to that particle. These corrections are generally 
under control for pretty much all of the particles in the Standard Model 
except for the Higgs particle, where the loop corrections to the mass tend to 
induce corrections that are about 10!6GeV in the context of any Grand Unified 
Theory. It’s possible to adjust parameters to get this value to be between 100 
and 250 GeV, but only by a very delicate cancellation to 13 decimal place 
precision, and this is just at one loop order. A two-loop correction to the 
Higgs mass would entail cancellation to 26 decimal place precision. What 
would cause such cancellations? 


One possible explanation — favored by a very large number of theorists — is 
that supersymmetry is responsible [290]. Recall that fermion loops contribute 
a minus sign whenever they appear. A loop with bosons contributes positively. 
This suggests that we could impose a new symmetry on a given theory, one 
in which every fermion loop is cancelled by a corresponding boson loop. 'This 
will only work if the boson in the loop has exactly the same properties as 
the fermion except for spin. A supersymmetric theory is one in which each 
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fermion has a corresponding boson partner with identical properties (mass, 
charge, etc.) except for spin. It is possible to formally transform the fermion 
wavefunction into its boson partner and vice versa — such a transformation is 
called a supersymmetry transformation. From the perspective of supersym- 
metry, these fermion and boson particles are really just two different states of 
one “superparticle,” analogous to the way that the electron and its neutrino 
are different states of one weak doublet. 


Superstring theories go a step further [11]. In such theories particles are 
regarded not as pointlike, but as line-like, or as a string. The string can be 
open — like a shoelace — or closed, like a rubber band, as illustrated in figure 
25.9. The idea is that the most fundamental elementary particle in nature 
is a string, and the known particles we see — the quarks, the leptons, the 
photon, the weak bosons, the gluon, even the graviton — are different kinds 
of vibrations of this fundamental object. It can be shown that these kinds of 
theories predict states of every possible allowed spin. One of these is a spin 
3/2 particle, referred to as the gravitino since it transforms with the graviton 
under supersymmetry. Many theorists view the string approach as attractive 
because it incorporates gravity into the theoretical scheme that describes the 
interactions of all of the other particles. String theory is presumed to be 


T Shorter 
I Distance 
M 
E 
point particle open string closed string 
FIGURE 25.9 


In string theory, point particles are large-distance approximations to strings. 


valid at very high energies, energies at which quantum gravity is expected 
to become important. This is higher than the GUT scale, so high that the 
the Compton wavelength of a particle of mass M becomes comparable to its 
Schwarzschild radius 2G M/c?, the radius of a black hole of the same mass. 
This energy scale is called the Planck scale, the scale at which all particle 
interactions are effectively indistinguishable from one another, and at which 
perhaps even our standard concepts of time and space break down. Some 
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kind of symmetry breaking mechanism is presumed to take place that reduces 
the theory to general relativity and a Grand Unified Theory of some kind at 
lower energies. 

It is clear that none of the known particles in the Standard Model can be 
the superpartners of any other particle since their properties are all so very 
different. Hence if supersymmetry is valid there must be at least twice as many 
particles as observed, since each known particle will have a superpartner. For 
example the electron will have a partner called the “selectron” that will have 
the same charge and mass as the electron but zero spin. This symmetry 
will ensure that the corrections to the Higgs mass are zero, since every loop 
contributing positively from a boson will be cancelled by one from its fermion 
superpartner. 

So where are these superpartners? None have yet been observed, and it 
is easy to see why. If a selectron really existed with the above properties 
it would bind to nuclei to form atoms. Since the selectron is a boson, the 
Pauli principle is not operative — many selectrons would Bose-condense in 
atoms, causing normal matter to implode. Supersymmetry therefore must be 
a broken symmetry if it is to play a role in the description of particle physics. 
A symmetry-breaking mechanism must be in place that ensures all unobserved 
superpartners gain masses that are too large to have been detected so far. This 
effect will spoil the perfect loop cancellations that correct the Higgs mass, but 
if the symmetry breaking scale is small enough (say about 1 TeV) then the 
imbalance between the boson and fermion loops can be kept small enough to 
ensure the corrections to the mass of the Higgs are not too large. 

This leads to a very attractive feature of supersymmetry — it restores the 
unification of the forces at high energy! Provided supersymmetry is broken 
at about 1 TeV, all three coupling constants become the same at GUT scale 
energies. The actual calculations from the running coupling constants in the 
minimal supersymmetric extension of the Standard Model give [289] 


Me = 1084+13 GeV 
Maus = 10/5659 ew (25.44) 
cl = 26.3 + 2.9 


in order to ensure unification, as illustrated in figure 25.10. The coupling 
constants are 


5 e Q = Q p< (gw) Q = Q E Is 
$ d Am hc à °  4mhic 


den 3 Anfic 
where the factor of 5/3 in the electromagnetic case is a convention chosen to 
normalize its generator relative to those of the strong and electroweak gauge 
groups. 

The numbers in eq. (25.45) are tantalizing, given that the LHC will ex- 
plore the TeV energy scale. Yet the mechanism for supersymmetry breaking 
is an outstanding problem. The simplest supersymmetric generalization to 
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FIGURE 25.10 

Running of the electromagnetic, weak, and strong coupling constants in the 
Standard Model (left), and in its minimal Supersymmetric (SUSY) extension 
(right). The thickness of the lines is indicative of the errors. 


the Standard Model has over 125 adjustable parameters, and there is neither 
a compelling approach to breaking supersymmetry nor a clear way to pre- 
dict these parameters. However, all supersymmetric theories have the feature 
that the superpartners cannot all decay into known particles. There must 
be a lightest supersymmetric particle (LSP) whose presence has so far gone 
undetected. This particle could perhaps be a dark matter WIMP referred to 
above. 

One of the major hopes of the LHC is to find some of these superpartners. If 
none are seen, new bounds on supersymmetry breaking and its role in physics 
will be established. Conversely, if definitive evidence for superpartners is 
obtained then our picture of particle physics will be revolutionized. 


25.6 Summing Up 


There are many new ideas for extensions to the Standard Model that are being 
explored in various laboratories around the world, and I have mentioned only 
a few of the most popular ideas here. Keep in mind that there are lots of other 
ideas out there for both theoretical generalizations and experimental study, 
and there is a need to generate more of each. Perhaps you will someday 
contribute in the quest to find the fundamental theory of physics. 


Beyond the Standard Model 515 


25.7 Questions 


1. (a) Consider neutrino oscillations in the full 3-flavor case. Show that 
the probability for a neutrino flavor Va to change into a flavor vg in time 
t is given by 


P (Va > vg) = 


mt 
2 VarVgr exp (35) | 


using the same approximations as for the 2-flavor case, where E is energy 
of the neutrino. 


b) Suppose now that we assume Am2, < Am24 ~ Am?,. Show that 
21 23 31 


Amst 
P (ve > wv) = sin? 55 sin? 2013 sin? ( S ) 


Amit 
P (Ve = vr) = cos? 023 sin? 2015 sin? (5728) 


Amat 
P (v, > Vr) = cos? 013 sin? 2913 sin? ( 11133 ) 


where the 9;; are the parameters of the MNS matrix. 
2. Assuming 213 is small, find P (v, — ve). 
3. Consider writing the CKM matrix in the same form as the MNS matrix: 


Vua Vus Vub 
Vog Ves Von 
Via Vis Vib 
C12C13 — 812013 sige 
= | 812023 — c128235130% C12C23 — 8128235136? $23C13 
$12823 — €12€23513€? —C12823 — 5120235136? C23C13 


as in eq. (21.55). 
OfE diagonal elements of the CKM matrix become increasingly smaller 
as the generation number increases. Suppose we define 


$12 = A $23 = AX sıze = AX? (p = in) 


and consider A to be a small parameter. 


(a) Find the approximate form of the CKM matrix, retaining in each 
entry the relevant power of A. 


(b) Why isn't this approximation valid for the MNS matrix? 
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. A “large” gauge transformation U acts on the vacuum |n) as follows 


Un) = [n +1) 


Show that the |0)-vacuum is eigenstate of U. 


. Show that the |0)-vacuum is unique. In other words, show that the 


amplitude for a transition from |0) to |6’) is zero unless 0 = 6’, where 
H is the Hamiltonian of the system. 


. Find two other diagrams that can yield the decay process 


p—>et +n’ 


in SU(5). 


. Find the energy at which the Compton wavelength of a particle of mass 


M is equal to its Schwarzschild radius. How much larger is it than the 
GUT scale? How large is it in kilograms? Convert it to units of time 
(called the Planck time) and distance (called the Planck length). 


A 


Notation and Conventions 


A.1 Natural Units 


'The fundamental constants of nature that appear in all studies of particle 
physics are Planck’s constant 7; and the speed of light c. These have values 


h = 1.05457266 x 10 ?* Js (A.1) 
c = 2.99792458 x 10% ns! (A.2) 
which are rather cumbersome to use. It is much more convenient to work in 
natural units* where h = x = l and c = 1. This also allows one to set the 


permittivity of free space, €o = 1, provided all charges are rescaled in units of 
(ic) 1/ ? Twill typically adopt these conventions, except on occasions where 
it is useful to illustrate the explicit units. This will typically be when I display 
a result that can be directly compared to experiment (such as a decay rate or 
a cross-section), in which case the factors of fi and c are useful. 

The easiest way to work with this in particle physics is to (a) express all 
velocities as a fraction of the speed of light and all times in terms of the 
light-travel distance (b) convert distances into units of inverse energy (or vice 
versa) as appropriate, using the conversion factor hc = 197 MeV-fm, and (c) 
express charges, masses and momenta in units of energy. 

So we use the following prescriptions: 


Physical Quantity Notation Units | Natural Physical 


velocity B unitless 8 — v/c 
time t h/MeV t t/h 
length d hc/MeV d — d/hc 
mass m MeV/c? m — mc? 
momentum p MeV /c p pc 
charge q unitless q — q/vhe 
energy E MeV 


*The reason why they are called natural units is because their definition is a consquence 
of the properties of nature rather than any human construct. It is common to include 
Newton's constant of gravity G, setting G — 1 in which case natural units are often called 
Planck units. 
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So in other words, if we see a given expression that depends on mass, time, 
momentum and energy, to convert it to standard units we apply the conversion 
factors in the right-hand column. I have given a set of examples in table A.1. 

Units of charge follow differing conventions. Particle physicists like to use 
Heaviside-Lorentz units, and I shall adopt these here. This means that the 
fine-structure constant is 


e 


TAI Arhe 
though I will typically use natural units and set A = c = 1. 


Coulomb's Law System Units 
F= TEES SI Coulombs 
F=f? Gaussian Electrostatic units (esu) 
P= 2S Heaviside-Lorentz Tz Electrostatic units (esu) 


A.2 Relativistic Notation 


A 4-vector a, will generally be noted in component form as a, = (ao, 7), 
where @ is a 3-vector. Repeated indices are almost always summed over all 
of their values — when this is not the case it will be explicitly indicated. So 
for example 


3 
A,: BY = M (A, B") = Ap B® + A,B! + A,B? + AB? 
u=0 


The reason for this convention is that the cumbersome summation symbols can 
be suppressed in writing expressions and doing calculations. I will use Greek 
letters (a, B, y, ...) to denote the components of 4-vectors (and 4-tensors), and 
Latin letters (a, b,c, ...) to denote components of 3-vectors. 


A.2.1 Metric 


A very important quantity is the metric g,,, which has the associated line 
element (or interval) 


ds? = g,, da" da" = (da?) — dz - dz = (da?)^ — (de)? — (dx?) — (da?) 


519 


a as id 
S unra) eune) 


A4) x41- —27 


aui = az E 
OA, 
us gue gu 
uomo SU po me 
UL 


uorssoIdxy prepureys 


cup ASI9UH quio[noj) 
A X= — 7T 103819d() umjuouroJN Iepn3uy 
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where in matrix form the metric is 


+1 
—1 
Suv = 7 
—1 
and its inverse is 
+1 
—1 
m 
I = -1 
—1 
which means that 
1 
1 
Iw 9” = guug^" = ô; = 1 


1 


and 69 is the relativistic Kronecker-delta function. From this perspective, the 
metric generalizes the notion of the 3-dimensional Kronecker-delta function 
i; to special relativity. It is the object that tells us how to measure distances 
and time intervals, and takes the dot-product between vectors. For any 4- 
vector we define 


à, : covariant a" : contravariant (A.3) 


a” = ga, = (a0, -T ) > a, = (ag, @) = Juva” (A.4) 


and the relativistic version of the dot-product is 


a-b= g,,0"b" = atb, = a,b" = g" a,b, = aobo — d «b 


In general the magnitude of a 4-vector is the dot-product of the vector 
with itself 


(4-vector)? = (timepart)? — (3-vector)? (A.5) 
2 


a? = a^a, = a2 - |T|? (A.6) 


'The negative sign means that a given 4-vector can be one of three possible 
types: 

Spacelike a? <0 

Timelike a? >0 

Null a? =O 
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A.2.2 Momentum and Energy 


The most important 4-vector we will be using is the 4-Momentum p, — 
(E, P), which has the following property 


? (where m is a constant) (A.7) 


p:psg"p,p,—-E^— g.y-m 
Or E? — |p ^ +m? (or, converting to standard (A.8) 
notation, E? = |p|? c + (m)? ) 
where we interpret E = po as the energy of a particle with 4-momentum p, 
and mass m. Depending on the physical situation, a 4-momentum in a given 
process can have any one of the possible norms: 


Spacelike p? <0 (e.g. for a mediator in a scattering process) 
Timelike p?>0 (e.g. (rest mass)? of a physical particle) 
Null p=0 (eg. a physical photon) 


A.2.3 Lorentz Transformations 
A Lorentz-transformation is 
a, =A," ay a SAP oP (A.9) 


a, V! = ap BY > GPA HAG” = gh > ATLAS = OX = ALAS 


The matrix A performs rotations and boosts: 


(1) Boosts (y = 1/4/1 — 8?) 


y —f400 E' = y(E — fps) 
: -6y y 00 p, = (Pe — BE) 
alo -axis: NE = x 
ng z-axi v 0 0 10 Py = Py 
0 0 01 P, = Pz 
7 0 00 E' = (E — Bp;) 
. 0 1 -670 Dp = Px 
along y-axis: AE = i 
By v 0-By y 0 Py = Y(p, — BE) 
0 0 0 1 Py = Py 
y 00 —By E' = 7(E — fp) 
E Ho oL " 
along z-axis: A", 0 01 0 xad P, = Py 
-fy00 y p, = Y(p, — BE) 
(2) Rotations 
10 0 0 E-E 
: 01 0 0 Pl. = Po 
"IT Bou E 
about x-axis: A" 00 «cos D^ xin d E p, = cos Ü p, + sin 0 p; 


0 0 —sinó cos 0 p, = — sin 0 p, + cos 0 p; 
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100 0 E-E 
Os E coss 
nicus Re pe 0 cos0 0 — sin 0 — d Ps d Px — sin 0 pz 
4 001 0 Py = Py 
0 sin? 0 cosé p. = sind p, + cos 0 pz 
1 0 0 0 E=E 
. 0 cos@ sin 0 pl, = cos0 p, + sind p 
6 TI H = x T y 
OS At 0—sin8cos80 | ^ Py = —sin0 p, + cos0 py 


From the above, we see that A is the generalization of a rotation matrix to 
special relativity. It obeys the following properties 


( 
e (A1)*,, (A2) 4 = (iz)? 4. (2 Lorentz-tmfs — Lorentz tmf) 
( 


A.3 Greek Alphabet 


TABLE A.2 

Greek Alphabet 
alpha a A nu V N 
beta B B omicron o O 
gamma y DL xi € X 
delta ô A pi T II 
epsilon &e E rho po P 
zeta ¢ Z sigma cs Y 
eta 7) H tau T T 
theta 0,0 © upsilon v Y 
iota L I phi bp 9 
kappa nx K chi X = 
lambda A A psi Y V 
mu " M omega w,w Q 


B 


Kronecker Delta and Levi- Civita Symbols 


B.1 Kronecker Delta 


The identity matrix in any N-dimensional space is denoted in index form as 
67, where I, J, — 1,..., N. It is defined as 


jJ [0 IzZJ >. 
v= jJ = diag(1, 1, 1,..., 1) 


or in other words as a matrix with 1’s along the diagonal and zeroes everywhere 
else. The trace of an N x N matrix Ay, is 


Tr[A] = Ary! = AM 577 = Ajó), 


The indices on 97 can be raised or lowered without penalty except in the 
case of relativistic spacetime, where the metric raises and lowers indices. In 
general, the indices in the Kronecker delta function take on the values and 
notation of the indices in the space under consideration. 

For example, in spacetime the indices of the Kronecker-delta function would 
be Greek indices running over the values 0,1,2,3, and we have 


1000 


0100 
o ÓN HO __ Om __ 
ô, = Quvg = gvug = 0010 


0001 


Here only 6° has meaning; the quantities 6” and 6,, are meaningless. 
In three-dimensional space the Kronecker-delta function is denoted 6;; where 


o l 100 
dij = 09 = ô; = 010 
001 


and here the indices can be raised lowered as is convenient. 
In the two-dimensional space of Pauli matrices 


— sab sa . 10 
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where a,b have the values 1,2. 
In the four-dimensional space of Dirac matrices 


1000 
0100 
0010 
0001 


bored 


where a,b have the values 1,2,3,4. The range and labeling of the indices is 
context-dependent. 


B.2 Levi-Civita Symbol 


The N-dimensional Levi-Civita symbol (or epsilon-tensor) is denoted € y, ... Jy, 
where Jj, J5,... Jy = 1,...,N. It has as many indices as the dimension of 
the space in which it is defined, and is defined by 


+1 if J,,..., Jw is an even permutation of 1,2,3,...,N 
E In = $ —1 if J1,..., Jv is an odd permutation of 1,2,3,...,N 
0 otherwise (when any two indices are equal) 
or in other words the &7,..;, symbol vanishes unless all indices are differ- 


ent and is otherwise +1 depending on the permutation of the indices. The 
determinant of an N x N matrix Aj is 


det[A] = Ann Alo Jo ras LE Du HN (B.1) 
= Abn AP? see ANIN Et INEJ JN (B.2) 
—Audviea i NEL d (B.3) 
= A11 422433 s Ann EE A12 421 A33 m ANN sets (B.4) 


The ordering of the indices of £7,...;,, matters but (as with the Kronecker 
delta) they can be raised or lowered without penalty except in the case 
where the space under consideration is spacetime, in which case the metric 
raises/lowers indices. In general, the indices in the Levi-Civita symbol take 
on the values and notation of the indices in the space under consideration. 

For example, in spacetime the indices of the Levi-Civita symbol would be 
Greek indices running over the values 0, 1, 2,3, and we have 


gives — gP g> g GF Epor (B.5) 


uvaB _ J —1 (if uva is an even permutation of 0123) (B.6) 
2 ~ | +1 (if uva is an odd permutation of 0123) j 
g9123 = -1 £0193 = +1 

_ [44 (if uva is an even permutation of 0123) (B.7) 
RSS za (if uva B is an odd permutation of 0123) ` 
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The following identities hold in spacetime: 


pae ccm —ó76200 — 620267, = 70762 
+57 552 + 006207, + 09 020% 
EMB nse = —2 (6268 — 5262) 
gh aee = —652 
ctv Be ep = —24 
In three dimensions 
ir _ J +1 (if ijk is an even permutation of 123) 
ame | aes ee (if ijk is an odd permutation of 123) 


12 
E123 = €? = 41 
In two dimensions 


ij 202, J +1 (if ij is an even permutation of 12) 
i. n (if ijk is an odd permutation of 12) 


= (210) 


£12 = el? = +1 
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Dirac Delta- Functions 


'The Dirac ó-function is defined as 
0 A0 
d(x) = 
f lo daó(x) = 


and is mathematically a distribution, since it is undefined at x — 0. It is kind 


of a functional generalization of the Kronecker-delta. 
Any continuous function f(x) obeys the rule 


since 6(x) vanishes when x Æ 0. This means that in general 


Jarro) = F(a) 


which can easily be shown by writing x’ = x — a in the above integral. 


More generally 


no ->ý 


e 


with g'(z;) = da where 


q nm 


g(x;) =0 i=1l,...n 


are the places where the function g(r) vanishes. This is straightforwardly 


shown by letting y — g(x), so that dx — 265 and y = 0 when z = zi. 


The 


range of integration flips sign if g'(z;) < 0, which gives a negative value for 
the answer in all such cases, hence leading to the absolute-value |g’ («;)| in the 


denominator. 
The notation 6" refers to a product over n variables: 


5) (x) = 0(24)6(22) +++ (24) 
so that 


5 (p — p) = &((p — p)1)9((p — p')2)5((p — p’)3)5((p — p')a) 
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The 6-function can also be understood as the derivative of the Heaviside 
step function O(x): 
0 x<0 
O(z) = 
1 r0 


We can see this as follows. Instead of integrating the ó-function over all values 
of x, integrate it up to an arbitrary value of x. This gives 


i 0 r«0 
f dyó(y) = 
— oo 1 


because clearly if x < 0 then the ó-function vanishes everywhere in the range 
of integration, and if x > 0 we have included the singular point x = 0, and so 
we can continue to integrate out to x = oo (since 6(x) = 0 for x > 0), yielding 
a value of 1 for the integral. Consequently 


r0 


' mys eu ud 9 oan 


D 


Pauli and Dirac Matrices 


D.1 Pauli Matrices 


'The 3-Pauli matrices are Hermitian, unitary, traceless 2 x 2 matrices: 


eno (DES) 


We don't distinguish between upper and lower indices, so that 


c! 01,0? 05,0? 03 


We have the product rule 
0j0j = o1 + 16: jkOk = bi + ÍCijkO 


where the 2 x 2 unit matrix J is often suppressed, as in the second part of the 
expression above. This rule implies 


(01) = (e: = (03) — 1 
70102 = iog and cyclic (D.1) 


[ci, 9] = ZERO [0,, 05) = 284; 
and for any two vectors à and b: 
(a. &) (5-5) =a-b+i(ax b) a 


We also have the exponential relation 


a aa : 
exp [ë a] = Y 2 =c0s0 + if - sin 


n! 


where 6 = 60. 
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D.2 Dirac Matrices 


The Dirac y-matrices are four unitary traceless 4 x 4 matrices defined by the 
relation 


tr, y) = 29"" 


and are most commonly represented in the form 


where T is the 2 x 2 identity matrix and the c? are the Pauli matrices. 
The indices of the Dirac matrices are raised/lowered using the metric 


W= Iw  ——0—Y SY 
For any 4-vector a^ we have 
d= a^, = ayy" 
and the “bar” of any 4 x 4 matrix M is 
M =P Mta? 


From the Dirac matrices the additional matrices 


l 0I 
a ed lyra | a) (D.2) 
v i V 
De a qr) (D.3) 
i i ( Q0 o? s d c^ 0 
yo EE e 5) 2% = See (q ^) (D.4) 


are in common usage, y? for parity transformations and helicity projections 
and >*” for Lorentz transformations. We also have 


(4,49) 20 


D.3 Identities and Trace Theorems 


'The Dirac matrices obey the following useful identities 
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Ya yy y? = Ag"" 
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W =4 
Yat y* = —24 


ad^ = 4a - b 


Ya yy ye = 29 y^" Yap” = —244) 


where the bar of a matrix I is T = 49T!4?. These can be used to prove the 
following trace theorems 


Tr 


Tr 


N=4 Tr [odd # of y-matrices] = 0 
y] = 4g"" Tr [4] =4a-b 
[yy gea] = 4 (g gP gto g^ + gge) 


Abed] =4(a-bc-d—a-cb-d+a-db-c) 
[y] =0 Tr [yyy] = 0 


[o9 yy" yo? ] = Aero? Tr [dped] = dict? a,b, Cade 


E 


Cross-Sections and Decay Rates 


E.1 Decays 


For a single particle of 4-momentum p” decaying into m particles of 4-momenta 


pi ---p/! via the process 


1—1'-2'c..m 
the decay rate is 


Sc 
2hE 


dU = 


7 c(Ap/)' / / 5(4) 
II TONS G^ m IM p) (2m)* Yu 
i=l i 


where E is the energy of the decaying particle . If this particle is at rest and 
has mass M then E = Mc?. The quantity S is a statistical factor — it equals 
1/n! for each group of n identical particles in the final state. 

If m — 2 then there are only 2 particles in the final state, in which case the 
decay rate simplifies to 


S |p'|c 
sss = Ed AI IM]? 
8a (Mc) 


when the initially decaying particle is at rest, and where p’ is the momentum 
of either of the outgoing momenta in the final state, with (p! --- p, |M| p) = M 


cy M4 + mi, + m4, — 2m2,m3, — 2M?m$, — 2M?m3, 
2M 
and S = 1/2! if the outgoing particles are identical. 


Ip'- 


E.2  Cross-Sections 


For two particles of 4-momenta p} and p$ that collide and produce m particles 
of 4-momenta p!" -- - p/^ via the process 


14+2—31'42'+---m! 
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the cross-section is 


128 rr c(Ap))? 


1 1 2 
5 | P1 Pm M pipo) (E.1) 
i=l 2E; (27) 


do = 


Ay (mi: pa)” — pipe 
x (22)* 54 (Sn — pa -») (E.2) 
i=l 


where now (p! --- p”,, |M| pipe) = M and as before S is a statistical factor that 
equals 1/n! for each group of n identical particles in the final state. 


E.2.1 2-Body CMS 


If m — 2 then there are only 2 particles in the final state, and in a reference 
frame where the spatial momenta of the colliding particles are equal and 
opposite (the CMS, or center-of-momentum system) so that pı = —p», the 
cross-section simplifies to 


(2) al SIM? |p" 
dQ 2-Body CMS 87 (E, + Es)” |» 


where the outgoing momenta obey the relation 


and where 
(pi: p2)? — p2p2 = (E, + Es) [pr] /c 


E.2.2 2-Body Lab Frame 


By definition the lab frame is the frame where one particle is at rest, which 
can be taken to be particle # 2 so that p2 = 0 and so Ez = Mac?. 

In general, the formula is messy, even if there are only two particles in the 
final state. However, if the two outgoing particles are massless (My = M» = 
0) then the cross-section simplifies to 


(=) 7 ( h ) SIME Ig 
do 2-Body inelastic-LAB 8m M3 [pi | [(Eı t Mac?) - [pi | CCOS 6] 


If the particles scatter elastically, so that the outgoing particles are the same 
as the two particles in the initial state (1 + 2 — 1 + 2) then the differential 
cross section simplifies to 


[e y h ) S|MP Ii 
do 2-Body elastic-LAB 87 M3 [pi | [lp] (Er ai Mac?) = [pi | Ei Cos 0] 
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If Mı = 0 (so that the incoming particle is massless) then this formula 
further simplifies to 


NEC 
dQ 2-Body elastic-LAB 81 MacE, 


Alternatively, if Mac? >> E, (when the target recoil can be neglected), it 
y. 8 


simplifies to 
d E 
(is) lea) 
do 2-Body elastic-LAB 81 M»c 


and in both cases S = 1 because the two particles in the final state cannot be 
identical. 
In all of these cases 


(51: pa)” — p2p3 = Mac|pi| 


F 
Clebsch-Gordon Coefficients 


Clebsch-Gordon tables, reproduced here from the Particle Data Book [1], 
contain explicit formulae for all the CJ, ( j1, ja; m4 ma) coefficients given in 
eq. (5.29). The total spins (71, j2) being combined are given in the upper left 
of one of the sub-tables. The respective (m4, m2)-values (or z-components) 
of these spins are given in the lower-left boxes in a subtable, and the possible 
output |jm) wavefunctions are in the upper right boxes in the same sub- 
table. The Cj, 's are the square roots of the numbers in the relevant middle 
boxes, where the minus sign (if there is one) goes outside of the square root. 
Additional formulae are for lowest order spherical harmonics. 
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Fundamental Constants 


The constants of nature are taken from CODATA (Committee on Data for 
Science and Technology), and current values can be found at 


http:/ /physics.nist.gov/cuu/Constants/index.html 
and from the Particle Data Group, where current values are at 


http:/ /pdglive.lbl.gov/listingsl1.brl?exp- Y 
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Properties of Elementary Particles 


The following tables list some basic properties of the elementary particles — 
the gauge bosons, leptons, and quarks — as well as of some of the properties 
of the lowest-energy quark bound states (mesons and baryons). 
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Feynman Rules for the Standard Model 
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The Large Hadron Rap 


Lyrics by Kate McAlpine; used with permission 


Twenty-seven kilometers of tunnel under ground 

Designed with mind to send protons around 

A circle that crosses through Switzerland and France 

Sixty nations contribute to scientific advance 

Two beams of protons swing round, through the ring they ride 
"[il in the hearts of the detectors, they're made to collide 

And all that energy packed in such a tiny bit of room 
Becomes mass, particles created from the vacuum 

And then 


LHCb sees where the antimatter's gone 

ALICE looks at collisions of lead ions 

CMS and ATLAS are two of a kind 

They’re looking for whatever new particles they can find. 
'The LHC accelerates the protons and the lead 

And the things that it discovers will rock you in the head. 


We see asteroids and planets, stars galore 

We know a black hole resides at each galaxy's core 

But even all that matter cannot explain 

What holds all these stars together something else remains 
'This dark matter interacts only through gravity 

And how do you catch a particle there's no way to see? 
Take it back to the conservation of energy 

And the particles appear, clear as can be 


You see particles flying, in jets they spray 

But you notice there ain't nothin', goin' the other way 

You say, “My law has just been violated — it don’t make sense! 
'There's gotta be another particle to make this balance." 

And it might be dark matter, and for first 


Time we catch a glimpse of what must fill most of the known "Verse. 


Because 
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LHCb sees where the antimatter's gone 

ALICE looks at collisions of lead ions 

CMS and ATLAS are two of a kind 

They’re looking for whatever new particles they can find. 


Antimatter is sort of like matter’s evil twin 

Because except for charge and handedness of spin 
They’re the same for a particle and its anti-self 

But you can't store an antiparticle on any shelf 

Cuz when it meets its normal twin, they both annihilate 
Matter turns to energy and then it dissipates. 


When matter is created from energy 

Which is exactly what they'll do in the LHC 

You get matter and antimatter in equal parts 

And they try to take that back to when the universe starts 
The Big Bang back when the matter all exploded 

But the amount of antimatter was somehow eroded 
Because when we look around we see that matter abounds 
But antimatter's nowhere to be found. 

That’s why 


LHCb sees where the antimatter's gone 

ALICE looks at collisions of lead ions 

CMS and ATLAS are two of a kind 

They’re looking for whatever new particles they can find. 
'The LHC accelerates the protons and the lead 

And the things that it discovers will rock you in the head. 


'The Higgs Boson - that's the one that everybody talks about. 
And the one sure thing that this machine will sort out 

If the Higgs exists, they ought to see it right away 

And if it doesn't, then the scientists will finally say 

"There is no Higgs! We need new physics to account for why 
'Things have mass. Something in our Standard Model went awry." 


But the Higgs Istill haven't said just what it does 
'They suppose that particles have mass because 

There is this Higgs field that extends through all space 
And some particles slow down while other particles race 
Straight through like the photon it has no mass 

But something heavy like the top quark, it's draggin' its 
And the Higgs is a boson that carries a force 

And makes particles take orders from the field that is its source. 
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They'll detect it. 


LHCb sees where the antimatter's gone 

ALICE looks at collisions of lead ions 

CMS and ATLAS are two of a kind 

They’re looking for whatever new particles they can find. 


Now some of you may think that gravity is strong 
Cuz when you fall off your bicycle it don't take long 
Until you hit the earth, and you say, “Dang, that hurt 
But if you think that force is powerful, you're wrong. 
You see, gravity — it's weaker than Weak 

And the reason why is something many scientists seek 
They think about dimensions we just live in three 
But maybe there are some others that are too small to see 
It's into these dimensions that gravity extends 

Which makes it seem weaker, here on our end. 

And these dimensions are “rolled up” curled so tight 
That they don't affect you in your day to day life 

But if you were as tiny as a graviton 

You could enter these dimensions and go wandering on 
And they'd find you... 


I» 


When LHCb sees where the antimatter's gone 

ALICE looks at collisions of lead ions 

CMS and ATLAS are two of a kind 

They’re looking for whatever new particles they can find. 
'The LHC accelerates the protons and the lead 

And the things that it discovers will rock you in the head. 
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Particle physics is both mathematically and conceptually challenging, and 
many have thought that it can only be taught in a very superficial way at 
the undergraduate level, if it is taught at all. In my 20 years of teaching this 
subject | have found that students can indeed rise to the challenge if both 
the formalism and background are carefully explained to them in a manner 
that allows them to connect with the physics they have already learned. 

— Robert Mann 


An Introduction to the Standard Model of Particle Physics familiarizes 
readers with what is considered tested and accepted and, in so doing, gives 
them a grounding in particle physics in general. Whenever possible, Dr. 
Mann takes an historical approach showing how the model is linked to the 
physics that most of us have learned in less challenging areas. Dr. Mann 
reviews special relativity and classical mechanics, symmetries, conservation 
laws, and particle classification; then working from the tested paradigm of 
the model itself, he— 


e Describes the standard model in terms of its electromagnetic, 
strong, and weak components 


e Explores the experimental tools and methods of particle physics 


e Introduces Feynman diagrams, wave equations, and gauge 
invariance, building up to the theory of quantum electrodynamics 


e Describes the theories of the strong and electroweak interactions 


e Uncovers frontier areas and explores what might lie beyond our 
current concepts of the subatomic world 


Those who work through the material will develop a solid command of the 
basics of particle physics. The book does require a knowledge of special 
relativity, quantum mechanics, and electromagnetism, but most importantly 
it requires a hunger to understand at the most fundamental level why things 
exist and how it is that anything happens. This book will prepare students 
and others for further study, but most importantly it will prepare them to open 
their minds to the mysteries that lie ahead. Ultimately, the Large Hadron 
Collider may prove the model correct, helping so many realize their greatest 
dreams...or it might poke holes in the model, leaving us to wonder an even 
more exciting possibility: that the answers lie in possibilities so unique that 
we have not even dreamt of them. 
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